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SPECIFIC PEPTIDE SUBSTRATE FOR HUMAN YAK1, YAK3A, AND YAK3B 
AND YEAST YAK1 PROTEIN KINASES 

Related Applications 

This application claims priority of U.S. Provisional Application Serial No. 60/163,901, 
5 filed on November 5, 1 999. 

Field of the Invention 

This invention relates to a newly identified peptide substrate for the serine/threonine 
protein kinase family members, human YAK1, human YAK3a and hYAK3b, and yeast YAK1. 
More particularly, the peptide of the present invention is derived from the carboxyl terminal end of 
10 bovine myelin basic protein (MBP). It is a 15 amino acid peptide with Ser at position 164 of the 

bovine MBP sequence as the major site of phosphorylation by hYAKl, hYAK3a and hYAK3b, and 
yeast YAK1, hereinafter referred to as Serl64. The sequence of Ser 164 is: 

Leu-Gly-Gly-Arg-Asp-Ser-Arg-Ser-Gly-Ser-Pro-Met-Ala-Arg-Arg (SEQ ID NO:9) 

155 160 164 169 

15 The invention also relates to a method of screening for inhibitors of human YAK1 and 

YAK3a and YAK3b, and yeast YAK1 protein serine/threonine kinases, and to inhibiting or 
activating the action of the above mentioned protein kinases and other YAK1 homologs by agents 
discovered using Ser 164 or related peptide analogs as a method of screening for inhibitors and ' 
activators. In addition, this invention relates to using the Ser 164 peptide to search for cellular 

20 substrates and/or cellular proteins that interact with the YAK family members of STPK's. 

Background of the Invention 

A number of polypeptide growth factors and hormones mediate their cellular effects through a 
signal transduction pathway. Transduction of signals from the cell surface receptors for these ligands 
to intracellular effectors frequently involves phosphorylation or dephosphorylation of specific protein 

25 substrates by regulatory protein serine/threonine kinases (PSTK) and phosphatases. Serine/threonine 
phosphorylation is a major mediator of signal transduction in multicellular organisms. Receptor- 
bound, membrane-bound and intracellular PSTKs regulate cell proliferation, cell differentiation and 
signaling processes in many cell types. 

Aberrant protein serine/threonine kinase activity has been implicated or is suspected in a 

30 number of pathologies such as rheumatoid arthritis, psoriasis, septic shock, bone loss, many cancers 
and other proliferative diseases. Accordingly, serine/threonine kinases and the signal transduction 
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pathways which they are part of are potential targets for drug design. 

Several PSTKs are involved in regulation of cell cycling. One example is the cyclin- 
dependent kinases or CDKs (Peter and Herskowitz, Cell 1994: 79, 181-184), which are activated by 
binding to regulatory proteins called cyclins and control passage of the cell through specific cell cycle 
5 checkpoints. For example, CDK2 complexed with cyclin E allows cells to progress through the Gl to 
S phase transition. The complexes of CDKs and cyclins are subject to inhibition by low molecular 
weight proteins such as pi 6 (Serrano et al, Nature 1993: 366, 704), which binds to and inhibits CDK4. 
Deletions or mutations in pi 6 have been implicated in a variety of tumors (Kamb et al, Science 1994: 
264, 436-440). Therefore, the proliferative state of cells and diseases associated with this state are 
10 dependent on the activity of CDKs and their associated regulatory molecules. In diseases such as 
cancer where inhibition of proliferation is desired, compounds that inhibit CDKs may be useful 
therapeutic agents. Conversely, activators of CDKs may be useful where enhancement of proliferation 
is needed, such as in the treatment of immunodeficiency. 

The yeast YAK1 is a PSTK with sequence homology to CDKs. It was originally identified in 
15 S. cerevisiae as a mediator of cell cycle arrest caused by inactivation of the cAMP-dependent protein 
kinase PKA (Garrett et al, Mol Cell Biol. 1991 : 1 1, 4045^052). YAK1 kinase activity is low in 
cycling yeast but increases dramatically when the cells are arrested prior to the S-G2 transition. 
Increased expression of YAK 1 causes growth arrest in yeast cells deficient in PKA. Therefore, YAK1 
can act as a cell cycle suppressor in yeast. 

20 Frequently, in diseases such as osteoporosis and osteoarthritis, patients have established 

lesions of bone or cartilage, respectively. Treatment of such lesions requires an agent that will 
stimulate new bone or cartilage formation to replace that lost to the disease; therefore, there is a need 
for drugs that increase the number of osteoblasts or chondrocytes, the cells responsible for bone or 
cartilage formation, respectively. Similarly, replacement of heart or skeletal muscle depleted by 

25 diseases such as myocardial infarction or HTV-associated cachexia requires drugs that stimulate 
proliferation of cardiac myocytes or skeletal myoblasts. 

This indicates that these protein serine/threonine kinases have an established, proven history 
as therapeutic targets. Clearly there is a need for identification and characterization of further 
members of the protein serine/threonine kinase family which can play a role in preventing, 
30 ameliorating or correcting dysfunctions or diseases, including, but not limited to, bone loss including 
osteoporosis; inflammatory diseases such as Adult Respiratory Disease Syndrome (ARDS), 
Rheumatoid arthritis, Osteoarthritis, Inflammatory Bowel Disease (IBD), psoriasis, dermatitis, asthma, 
allergies; infections such as bacterial, fungal, protozoan and viral infections, particularly infections 
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caused by HTV-1 or HIV-2; HIV-associated cachexia and other immunodeficiency disorders; septic 
shock; pain; injury; cancers; anorexia; bulimia; Parkinsons disease; cardiovascular disease including 
restenosis, atherosclerosis, acute heart failure, myocardial infarction; hypotension; hypertension; 
urinary retention; angina pectoris; ulcers; benign prostatic hypertrophy; and psychotic and 
neurological disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation and dyskinesias, such as Huntington's disease or Gilles dela Tourett's syndrome. 

Previously, a novel human homolog of yeast YAK1, termed hYAKl, which is expressed in 
osteoblasts, chondrocytes, cardiac and skeletal muscle, and at lower levels, in placenta and pancreas 
was described (CREASY et al., EP Publication Number 98301 124.8, published February 16, 1998, 
the disclosure of which is incorporated herein by reference in its entirety). The sequence of hYAKl 
shares homology with predicted PSTK's from C. elegans, S. pombe and S. cerevisiae and has motifs 
associated with known protein kinases. Inhibitors of hYAKl are expected to stimulate proliferation of 
cells in which it is expressed. 

Another novel human homolog of yeast YAK1 , termed hYAK3b, which is expressed in testis, 
skeletal muscle, and in hematopoietic cells that can undergo erythroid differentiation (XIE, and 
CREASY, U.S. Patent Number 5,965,420, granted October 12, 1999, and U.S. Application Serial 
Number 09/359,257, filed July 22, 1999, the disclosures of which are incorporated herein by 
reference in their entireties). The sequence of hYAK3b shares 79% amino acid identity with hYAKl 
in its kinase domain, and shares homology with predicted PSTK's from C elegans, S. pombe and S. 
cerevisiae. 

hYAKl and hYAK3b are members of a novel subfamily of protein kinases with unique 
structural and enzymatic features. This family had been termed Dyrk (for dual specificity YAK- 
related kinases) by Dr. Joost and his co-workers. In their publications, hYAKl and hYAK3b were 
termed Dyrk2 and Dyrk3, respectively (Becker and Joost, Prog. Nucl. Acid Res., 1999, 62, 1-17). 

Summary of the Invention 

In one aspect, the invention relates to a peptide substrate for hYAKl, hYAK3a and 
hYAK3b and yeast YAK1 . This peptide, which has the sequence LGGRDSRSGSPMARR(SEQ ID 
NO:9), is derived from the carboxyl terminal end of bovine MBP (residues 155-169), and in it the 
Ser residue at position 164 (underlined) was identified as the major phosphoacceptor residue for 
hYAKl . Hereinafter this peptide is referred to as Serl64. This same residue was identified as the 
major phosphoacceptor site also for hYAK3a and 3b and yeast YAK1 PSTKs. One aspect of the 
invention relates to methods for using the Ser 1 64 peptide as a substrate to screen for inhibitors of the 
kinase activity of hYAKl , hYAK3a, hYAK3b, yeast YAK1, and other members of this family of 
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protein kinases. Such uses include the treatment of bone loss including osteoporosis; inflammatory 
diseases such as Adult Respiratory Disease Syndrome (ARDS), Rheumatoid arthritis, Osteoarthritis, 
Inflammatory Bowel Disease (IBD), psoriasis, dermatitis, asthma, allergies; infections such as 
bacterial, fungal, protozoan and viral infections, particularly infections caused by HIV-1 or HIV-2; 
5 HTV-associated cachexia and other immunodeficiency disorders; septic shock; pain; injury; cancers; 
anorexia; bulimia; Parkinson's disease; cardiovascular disease including restenosis, atherosclerosis, 
acute heart failure, myocardial infarction; hypotension; hypertension; urinary retention; angina 
pectoris; ulcers; benign prostatic hypertrophy; and psychotic and neurological disorders, including 
anxiety, schizophrenia, manic depression, delirium, dementia, severe mental retardation and 

10 dyskinesias, such as Huntington's disease or Gilles dela Tourett's syndrome, among others. In still 
another aspect, the invention relates to methods to identify agonists and antagonists using the 
materials provided by the invention, and treating conditions associated with hYAKl, hYAK3a and 
3b, and yeast YAK1 imbalance with the identified compounds. Yet another aspect of the invention 
relates to diagnostic assays for detecting diseases associated with inappropriate hYAKl, hYAK3a and 

15 3b, and yeast YAK1 activity or levels. Still another aspect of the invention is the use of the SI 64 
peptide sequence to identify cellular proteins that are phosphorylated by or interact with hYAKl, 
hYAK3a and 3b, and yeast YAK1. 

Brief Description of the Drawings 

Figure 1 shows the determination of hYAKl -catalyzed phosphorylation sites on bovine 

20 MBP. 

Figure 2 shows the activity of hYAKl, hYAK3b, and yeast YAK1 against bovine MBP and 
the SI 64 peptide. 

Figure 3 shows the results of steady state two substrate analysis to determine the kinetic 
constants of the kinase reaction of purified recombinant hYAKl using the Serl64 as substrate. 

25 Description of the Invention 

Definitions 

The following definitions are provided to facilitate understanding of certain terms used 
frequently herein. 

"hYAKl" refers, among others, generally to a polypeptide having the amino acid sequence 
30 set forth in SEQ ED NO:4 or an allelic variant thereof. 
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"hYAKl activity or hYAKl polypeptide activity" or "biological activity of the hYAKl or 
hYAKl polypeptide" refers to the metabolic or physiologic function of said hYAKl including 
similar activities or improved activities or these activities with decreased undesirable side-effects. 
Also included are antigenic and immunogenic activities of said hYAKl. 

"hYAKl gene" refers to a polynucleotide having the nucleotide sequence set forth in SEQ 
ED NO:3 or allelic variants thereof and/or their complements. 

"hYAK3a" refers, among others, generally to a polypeptide having the amino acid sequence 
set forth in SEQ ID NO:6 or an allelic variant thereof. 

"hYAK3a activity or hYAK3a polypeptide activity" or "biological activity of the hYAK3a 
or hYAK3a polypeptide" refers to the metabolic or physiologic function of said hYAK3a including 
similar activities or improved activities or these activities with decreased undesirable side-effects. 
Also included are antigenic and immunogenic activities of said hYAK3a. 

"hYAK3a gene" refers to a polynucleotide having the nucleotide sequence set forth in SEQ 
ID NO:5 or allelic variants thereof and/or their complements. 

"hYAK3b" refers, among others, generally to a polypeptide having the amino acid sequence 
set forth in SEQ ID NO: 8 or an allelic variant thereof. 

"hYAK3b activity or hYAK3b polypeptide activity" or "biological activity of the hYAK3b 
or hYAK3b polypeptide" refers to the metabolic or physiologic function of said hYAK3b including 
similar activities or improved activities or these activities with decreased undesirable side-effects. 
Also included are antigenic and immunogenic activities of said hYAK3b. 

"hYAK3b gene" refers to a polynucleotide having the nucleotide sequence set forth in SEQ 
ID NO:7 or allelic variants thereof and/or their complements. 

"Yeast YAK1" refers, among others, generally to a polypeptide having the amino acid 
sequence set forth in SEQ ID NO:2 or an allelic variant thereof. 

"Yeast YAK1 activity or yYAKl polypeptide activity" or "biological activity of the yYAKl 
or yYAKl polypeptide" refers to the metabolic or physiologic function of said yYAKl including 
similar activities or improved activities or these activities with decreased undesirable side-effects. 
Also included are antigenic and immunogenic activities of said yYAKl. 

"Yeast YAK1 gene" refers to a polynucleotide having the nucleotide sequence set forth in 
SEQ ID NO:l or allelic variants thereof and/or their complements. 
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"Antibodies" as used herein includes polyclonal and monoclonal antibodies, chimeric, 
single chain, and humanized antibodies, as well as Fab fragments, including the products of an Fab 
or other immunoglobulin expression library. 

"Isolated" nieans altered "by the hand of man" from the natural state. If an "isolated" 
5 composition or substance occurs in nature, it has been changed or removed from its original 

environment, or both. For example, a polynucleotide or a polypeptide naturally present in a living 
animal is not "isolated," but the same polynucleotide or polypeptide separated from the coexisting 
materials of its natural state is "isolated", as the term is employed herein. 

"Polynucleotide" generally refers to any polyribonucleotide or polydeoxyribonucleotide, 
10 which may be unmodified RNA or DNA or modified RNA or DNA. "Polynucleotides" include, 
without limitation single- and double-stranded DNA, DNA that is a mixture of single- and double- 
stranded regions, single- and double-stranded RNA, and RNA that is mixture of single- and double- 
stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more 
typically, double-stranded or a mixture of single- and double-stranded regions. In addition, 
15 "polynucleotide" refers to triple-stranded regions comprising RNA or DNA or both RNA and DNA. 
The term polynucleotide also includes DNAs or RNAs containing one or more modified bases and 
DNAs or RNAs with backbones modified for stability or for other reasons. "Modified" bases 
include, for example, tritylated bases and unusual bases such as inosine. A variety of modifications 
has been made to DNA and RNA; thus, "polynucleotide" embraces chemically, enzymatically or 
20 metabolically modified forms of polynucleotides as typically found in nature, as well as the 
chemical forms of DNA and RNA characteristic of viruses and cells. "Polynucleotide" also 
embraces relatively short polynucleotides, often referred to as oligonucleotides. 

~ "Polypeptide" refers to any peptide or protein comprising two or more amino acids joined 

to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. "Polypeptide" 

25 refers to both short chains, commonly referred to as peptides, oligopeptides or oligomers, and to 
longer chains, generally referred to as proteins. Polypeptides may contain amino acids other than 
the 20 gene-encoded amino acids. "Polypeptides" include amino acid sequences modified either by 
natural processes, such as posttranslational processing, or by chemical modification techniques 
which are well known in the art. Such modifications are well-described in basic texts and in more 

30 detailed monographs, as well as in a voluminous research literature. Modifications can occur 
anywhere in a polypeptide, including the peptide backbone, the amino acid side-chains and the 
amino or carboxyl termini. It will be appreciated that the same type of modification may be present 
in the same or varying degrees at several sites in a given polypeptide. Also, a given polypeptide 
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may contain many types of modifications. Polypeptides may be branched as a result of 
ubiquitination, and they may be cyclic, with or without branching. Cyclic, branched and branched 
cyclic polypeptides may result from posttranslation natural processes or may be made by synthetic 
methods. Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or 
nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of 
phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, formation 
of covalent cross-links, formation of cystine, formation of pyroglutamate, formylation, gamma- 
carboxylation, glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, 
myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins such as 
arginylation, and ubiquitination. See, for instance, PROTEINS - STRUCTURE AND 
MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, W. H. Freeman and Company, New 
York, 1993 and Wold, F., Posttranslational Protein Modifications: Perspectives and Prospects, pgs. 
1-12 in POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. 
Johnson, Ed., Academic Press, New York, 1983; Seifter et al, "Analysis for protein modifications 
and nonprotein cofactors", Meth Enzymol (1990) 182:626-646 and Rattan et al, "Protein Synthesis: 
Posttranslational Modifications and Aging", Ann NY Acad Sci (1992) 663:48-62. 

"Variant" as the term is used herein, is a polynucleotide or polypeptide that differs from a 
reference polynucleotide or polypeptide respectively, but retains essential properties. A typical 
variant of a polynucleotide differs in nucleotide sequence from another, reference polynucleotide. 
Changes in the nucleotide sequence of the variant may or may not alter the amino acid sequence of 
a polypeptide encoded by the reference polynucleotide. Nucleotide changes may result in amino 
acid substitutions, additions, deletions, fusions and truncations in the polypeptide encoded by the 
reference sequence, as discussed below. A typical variant of a polypeptide differs in amino acid 
sequence from another, reference polypeptide. Generally, differences are limited so that the 
sequences of the reference polypeptide and the variant are closely similar overall and, in many 
regions, identical. A variant and reference polypeptide may differ in amino acid sequence by one or 
more substitutions, additions, or deletions in any combination. A substituted or inserted amino acid 
residue may or may not be one encoded by the genetic code. A variant of a polynucleotide or 
polypeptide may be a naturally occurring such as an allelic variant, or it may be a variant that is not 
known to occur naturally. Non-naturally occurring variants of polynucleotides and polypeptides 
may be made by mutagenesis techniques or by direct synthesis. 
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"Identity" reflects a relationship between two or more polypeptide sequences or two or 
more polynucleotide sequences, determined by comparing the sequences. In general, identity refers 
to an exact nucleotide to nucleotide or amino acid to amino acid correspondence of the two 
polynucleotide or two polypeptide sequences, respectively, over the length of the sequences being 
5 compared. For sequences where there is not an exact correspondence, a M % identity 11 may be 
determined. In general, the two sequences to be compared are aligned to give a maximum 
correlation between the sequences. This may include inserting "gaps" in either one or both 
sequences, to enhance the degree of alignment. A % identity may be determined over the whole 
length of each of the sequences being compared (so-called global alignment), that is particularly 
10 suitable for sequences of the same or very similar length, or over shorter, defined lengths (so-called 
local alignment), that is more suitable for sequences of unequal length. 

"Similarity" is a further, more sophisticated measure of the relationship between two 
polypeptide sequences. In general, "similarity" means a comparison between the amino acids of 
two polypeptide chains, on a residue by residue basis, taking into account not only exact 
15 correspondences between a between pairs of residues, one from each of the sequences being 
compared (as for identity) but also, where there is not an exact correspondence, whether, on an 
evolutionary basis, one residue is a likely substitute for the other. This likelihood has an associated 
'score* from which the "% similarity" of the two sequences can then be determined. 

Methods for comparing the identity and similarity of two or more sequences are well known 
20 in the art. Thus for instance, programs available in the Wisconsin Sequence Analysis Package, 
version 9.1 (Devereux J., et al, Nucleic Acids Res, 12, 387-395, 1984, available from Genetics 
Computer Group, Madison, Wisconsin, USA), for example the programs BESTFIT and GAP, may 
be used to determine the % identity between two polynucleotides and the % identity and the % 
similarity between two polypeptide sequences. BESTFIT uses the "local homology" algorithm of 
25 Smith and Waterman (J. Mol. Biol, 147:195-197, 1981, Advances in Applied Mathematics, 2, 482- 
489, 1981) and finds the best single region of similarity between two sequences. BESTFIT is more 
suited to comparing two polynucleotide or two polypeptide sequences that are dissimilar in length, 
the program assuming that the shorter sequence represents a portion of the longer. In comparison, 
GAP aligns two sequences, finding a "maximum similarity", according to the algorithm of 
30 Neddleman and Wunsch (J. Mo.l Biol, 48, 443-453, 1970). GAP is more suited to comparing 
sequences that are approximately the same length and an alignment is expected over the entire 
length. Preferably, the parameters "Gap Weight" and "Length Weight" used in each program are 50 
and 3, for polynucleotide sequences and 12 and 4 for polypeptide sequences, respectively. 
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Preferably, % identities and similarities are determined when the two sequences being compared are 
optimally aligned. 

Other programs for determining identity and/or similarity between sequences are also 
known in the art, for instance the BLAST family of programs (Altschul S.F., et al., J. Mol. Biol., 

5 215, 403-410, 1990, Altschul S.F., et al, Nucleic Acids Res., 25:389-3402, 1997, available from the 
National Center for Biotechnology Information (NCBI), Bethesda, Maryland, USA and accessible 
through the home page of the NCBI at www.ncbi.nlm.nih.gov) and FASTA (Pearson W R, Methods 
in Enzymology, 1 83: 63-99 (1990); Pearson W R and Lipman D.J., Proc Nat Acad Sci USA. 85: 
2444-2448 (1988) (available as part of the Wisconsin Sequence Analysis Package). 

10 Preferably, the BLOSUM62 amino acid substitution matrix (Henikoff S. and Henikoff J.G., 

Proc. Nat. Acad Sci. USA, 89: 10915-10919 (1992)) is used in polypeptide sequence comparisons 
including where nucleotide sequences are first translated into amino acid sequences before 
comparison. 

Preferably, the program BESTFIT is used to determine the % identity of a query 

1 5 polynucleotide or a polypeptide sequence with respect to a polynucleotide or a polypeptide 

sequence of the present invention, the query and the reference sequence being optimally aligned and 
the parameters of the program set at the default value, as hereinbefore described. 
Alternatively, for instance, for the purposes of interpreting the scope of a claim including mention 
of a "% identity" to a reference polynucleotide, a polynucleotide sequence having, for example, at 

20 least 95% identity to a reference polynucleotide sequence is identical to the reference sequence 
except that the polynucleotide sequence may include up to five point mutations per each 100 
nucleotides of the reference sequence. Such point mutations are selected from the group consisting 
of at least one nucleotide deletion, substitution, including transition and transversion, or insertion. 
These point mutations may occur at the 5' or 3' terminal positions of the reference polynucleotide 

25 sequence or anywhere between these terminal positions, interspersed either individually among the 
nucleotides in the reference sequence or in one or more contiguous groups within the reference 
sequence. In other words, to obtain a polynucleotide sequence having at least 95% identity to a 
reference polynucleotide sequence, up to 5% of the nucleotides of the in the reference sequence 
may be deleted, substituted or inserted, or any combination thereof, as hereinbefore described. The 

30 same applies mutatis mutandis for other % identities such as 96%, 97%, 98%, 99% and 1 00%. 

For the purposes of interpreting the scope of a claim including mention of a "% identity" to 
a reference polypeptide, a polypeptide sequence having, for example, at least 95% identity to a 
reference polypeptide sequence is identical to the reference sequence except that the polypeptide 
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sequence may include up to five point mutations per each 100 amino acids of the reference 
sequence. Such point mutations are selected from the group consisting of at least one amino acid 
deletion, substitution, including conservative and non-conservative substitution, or insertion. These 
point mutations may occur at the amino- or carboxy-terminal positions of the reference polypeptide 
5 sequence or anywhere between these terminal positions, interspersed either individually among the 
amino acids in the reference sequence or in one or more contiguous groups within the reference 
sequence. In other words, to obtain a sequence polypeptide sequence having at least 95% identity 
to a reference polypeptide sequence, up to 5% of the amino acids of the in the reference sequence 
may be deleted, substituted or inserted, or any combination thereof, as hereinbefore described. The 
10 same applies mutatis mutandis for other % identities such as 96%, 97%, 98%, 99%, and 100%. 

A preferred meaning for "identity" for polynucleotides and polypeptides, as the case may be, 
are provided in (1) and (2) below. 

(1) Polynucleotide embodiments further include an isolated polynucleotide comprising a 
polynucleotide sequence having at least a 95, 97 or 100% identity to the reference sequence of SEQ 

15 ID NO:l, (or SEQ ID NOS: 3, 5, or 7) wherein said polynucleotide sequence may be identical to the 
reference sequence of SEQ ID NO: 1 or may include up to a certain integer number of nucleotide 
alterations as compared to the reference sequence, wherein said alterations are selected from the 
group consisting of at least one nucleotide deletion, substitution, including transition and transversion, 
or insertion, and wherein said alterations may occur at the 5' or 3 1 terminal positions of the reference 

20 nucleotide sequence or anywhere between those terminal positions, interspersed either individually 
among the nucleotides in the reference sequence or in one or more contiguous groups within the 
reference sequence, and wherein said number of nucleotide alterations is determined by multiplying 
th e to tal number of nu cleoti des in SEQ ID N O: 1 b y the integer defining the pe rcent identity divided 
by 100 and then subtracting that product from said total number of nucleotides in SEQ ID NO:l, or: 

25 n n £ x n - (x n • y), 

wherein n n is the number of nucleotide alterations, x n is the total number of nucleotides in SEQ ID 

NO:l, y is 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and • is the symbol for the multiplication 
operator, and wherein any non-integer product of x n and y is rounded down to the nearest integer 

prior to subtracting it from x n . Alterations of a polynucleotide sequence encoding the polypeptide of 

30 SEQ ID NO:2 may create nonsense, missense or frameshift mutations in this coding sequence and 
thereby alter the polypeptide encoded by the polynucleotide following such alterations. 
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(2) Polypeptide embodiments further include an isolated polypeptide comprising a 
polypeptide having at least a 95, 97 or 100% identity to a polypeptide reference sequence of SEQ ID 
NO:2, (or SEQ ID NOS: 4, 6, or 8-18)wherein said polypeptide sequence may be identical to the 
reference sequence of SEQ ID NO:2 or may include up to a certain integer number of amino acid 
alterations as compared to the reference sequence, wherein said alterations are selected from the 
group consisting of at least one amino acid deletion, substitution, including conservative and non- 
conservative substitution, or insertion, and wherein said alterations may occur at the amino- or 
carboxy-terminal positions of the reference polypeptide sequence or anywhere between those terminal 
positions, interspersed either individually among the amino acids in the reference sequence or in one 
or more contiguous groups within the reference sequence, and wherein said number of amino acid 
alterations is determined by multiplying the total number of amino acids in SEQ ID NO:2 by the 
integer defining the percent identity divided by 100 and then subtracting that product from said total 
number of amino acids in SEQ ID NO:2, or: 

n a SXa- (x a • y), 

wherein n a is the number of amino acid alterations, x a is the total number of amino acids in SEQ ID 
NO:2, y is 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and • is the symbol for the multiplication 
operator, and wherein any non-integer product of x a and y is rounded down to the nearest integer 
prior to subtracting it from x a . 

Peptides of the Invention 

In one aspect, the present invention relates to the Serl 64 peptides. The Serl64 peptides include 
the peptide of the sequence in Figure 3; as well as peptide analogs which were made to study the 
structure activity relationship of Serl64 phosphorylation by these PSTK's. These analogs include but are 
not limited to peptides 1 to 8 below (modified residues are bolded): 

Leu-Gly-Gly-Arg-Asp-Ser-Arg-Ser-Gly-Ser-Pro-Met-Ala-Arg-Arg (Serl64) (SEQ IDNO:9) 
Leu-Gly-Gly-Arg-Asp-Ser-Arg-Ser-Gly-Ser-Ala-Met-Ala-Arg-Arg (Peptide 1) (SEQ ID NO:10) 
Leu-Gly-Gly-Arg-Asp-Ser-Arg-Ser-Gly-Thr-Pro-Met-Ala-Arg-Arg (Peptide 2) (SEQ ID NO:l 1) 
Leu-Gly-Gly-Arg-Asp-Ser-Arg-Ser-Gly-Ser-Pro-Pro-Ala-Arg-Arg (Peptide 3) (SEQ ID NO: 1 2) 
Leu-Gly-Gly-Arg-Asp-Ser-Arg-Ser-Pro-Ser-Pro-Met-Ala-Arg-Arg (Peptide 4) (SEQ ID NO: 1 3) 
Leu-Gly-Gly-Arg-Asp-Ser-Arg-Ser-Pro-Ser-Ala-Met-Ala-Arg-Arg (Peptide 5) (SEQ ID NO: 14) 
Leu-Gly-Gly-Arg-Asp-Ser-Arg-Ala-Gly-Ser-Pro-Met-Ala-Arg-Arg (Peptide 6) (SEQ ID NO:15) 
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Leu-Gly-Gly-Arg-Asp-Ala-Arg-Ser-Gly-Ser-Pro-Met-Ala-Arg-Arg (Peptide 7) (SEQ ID NO: 16) 

Leu-Gly-Gly-Arg-Asp-Ala-Arg-Ala-Gly-Ser-Pro-Met-Ala-Arg-Arg (Peptide 8) (SEQ ID NO: 17) 

Also included within the Serl64 peptides are truncated peptides analogs that do not have the first 
three residues (Leu-Gly-Gly). 

5 Detection of Serl64 as the site on MBP phosphorylated by the YAKs PSTKs: 

By screening several proteins as substrates, we determined that hYAKl, hYAK3a and 3b and 
yeast YAK1 preferred MBP as a substrate over casein, enolase, poly-Gly/Tyr, and several forms of 
histone. To identify the phosphorylation sites on bovine MBP catalyzed by each of the above 
mentioned protein kinases, bovine MBP was incubated for 30 minutes at 30 °C with 100 )iM ATP, 10 

10 mM MgC12, and purified hYAKl , purified hYAK3b, or yeast YAK 1 immunoprecipitated (using 
anti-HA Ab) from crude extracts of Saccharomyces cerevisiae transfected with HA-tagged yeast 
YAK1 . Such an in vitro phosphorylated MBP was digested with endoproteinase Lys-C for 2 hours 
at 38 °C in 50 mM Tris pH 8.8. An aliquot of the unfractionated protein digest was analyzed by 
MALDI TOF for the presence of phosphorylated peptides. MALDI mass spectra (Annan, and Carr, 

15 (1996) Anal. Chem. 68, 3413-3421) were recorded on a Micromass TofSpec SE single-stage, 
gridless reflectron time-of-flight mass spectrometer equipped with a time-lag-focusing source. 
Samples were prepared by mixing the digest with an equal volume of matrix solution which 
contained two internal mass standards at a concentration of 200 frnol/|il. A 0.5 jil aliquot of this 
sample/matrix solution was applied to the MALDI target. The matrix solution was 10 ng/ml a- 

20 Cyano-4-hydroxycinnamic acid in 50:50 ethanol:acetonitrile. Samples were then irradiated by 337 
nm photons from a pulsed Laser Science nitrogen laser operating at 5 Hz. Typically, 20-50 laser 
shots were summed into a single mass spectrum. Potential phosphopeptide precursor ions were 
selected for post-source-decay (PSD) analysis using a Bradbury-Nielsen ion gate. Phosphopeptides 
were isolated by reversed phase HPLC using a 1mm i.d. CI 8 column. Peptides are eluted using 

25 acetonitrile:water:0.1% trifluoroacetic acid gradients at 50 fil/min. The column eluent was split 
post detector such that 5 jil/min was sent to a Micromass QTOF quadrupole time-of-flight mass 
spectrometer, and 45 fal/min went to a fraction collector taking one minute fractions. 
Phosphopeptides were sequenced by tandem MS (Verma, et al. (1997) Science, 278, 455-560.) 
using the QTOF mass spectrometer equipped with a nanoelectrospray source (Wilm, and Mann, 

30 (1994) Analytical Chem. 66, 4390-4399 1994). 

MALDI-TOF analysis of 5% of the unfractionated endoproteinase Lys-C digests from 
phosphorylated and nonphosphorylated MBP revealed 2 apparent phosphate-containing peptides at 
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m/z 1571.78 (MH+) and 1682.80 (MH+) (Fig. l(labelled A)), corresponding to peptides of 
molecular mass 1570.77 and 1681 .79, respectively. The 1570 Da species is consistent with the 
mass of a peptide containing residues 91-104 from MBP plus one mole of phosphate (calculated 
mass, 1570.78 Da), whereas the 1681 Da peptide was consistent with the mass of the C-terminal 
Lys-C fragment from MBP incorporating residues 155-169 plus one mole of phosphate (calculated 
mass, 1681 .78). Observation of the characteristic loss of H 3 P0 4 (98 Da) from both peptides upon 
subsequent PSD analysis (Annan, and Carr, (1996) Anal. Chem. 68, 3413-3421) confirmed that 
these peptides were, in fact, phosphorylated. 

The Lys-C digests were fractionated by reverse phase HPLC using MS as an on-line 
readout of the fractions. Fractions containing the two putative phosphopeptides were analyzed by 
nanoES, and the appropriate precursor ions sequenced by collision-induced dissociation tandem 
mass spectrometry. The spectrum obtained for the 1570.9 peptide (Fig. 1 (labelled B)) is consistent 
with two phosphorylated forms of the sequence NTVTPRTPPPSQGK (SEQ ID NO:18), which 
corresponds to residues 91-104 of bovine MBP. The major phosphorylation site was found to be 
Thr97. The 181.04 Da mass difference between the b 6 and b 7 ions indicates the presence of 
phosphothreonine at this position. A very small amount of phosphorylation was also found at 
Thr94. The evidence for phosphorylation at this second site was a series of ^ ions starting at b 4 
and ending at b 6 . (The evidence for phosphorylation at either site would converge at the b7 ion.) 

The tandem mass spectrum of the 1681 peptide (Fig. l(labelled C)) is consistent with a 
monophosphorylated form of LGGRDSRSGSPMARR (SEQ ID NO:9), corresponding to the C- 
terminal residues 155-169 of bovine MBP. Although the tandem MS data clearly indicate that Serl60 
is not phosphorylated, it was unable to distinguish between Serl62 or Serl64 as the sole site of 
phosphorylation in this peptide. Assignment of the phosphorylation site as Serl64 is based on the 
presence of an S/TP sequence at this site, which is analogous to the motif found at Thr94 and Thr97. 

Phosphorylation of bovine MBP on Thr97 by the MAP kinases has been previously 
reported (Erikson, et al., (1990) J. Biol. Chem. 265, 19728-1 9735 ;Clark-Lewis, et al., (1991) J. 
Biol. Chem. 266, 1 1580-1 1584.). However, we are not aware of any kinase that phosphorylates 
bovine MBP on Serl64. Therefore, the phosphorylation pattern of yeast YAK1 on MBP appears to 
be unique. 

Kinase Assays Using Serl64 as the phosphoacceptor 

The source of Ser 164 peptide : It was custom-ordered from California Peptide Research Inc. 
(Napa, CA), and its purity was determined by HPLC. The peptide contained 15 amino acids, and its 
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calculated molecular mass was 1601.82 daltons. Solid sample was dissolved at 5 mM in ice-cold 
kinase assay buffer (see later), aliquoted, and stored at -20 °C until use. 

Figure 1 shows a determination of YAK 1 -induced phosphorylation sites on myelin basic protein. 
A, Partial MALDI mass spectrum of an unfractionated Lys-C digest of in vitro phosphorylated MBP. 
Peptides modified by a single mole of phosphate show an increase in mass of 80 Da. B, ES-CID tandem 
mass spectrum of the M 2+ ion (m/z 786.4) from the 1570 Da phosphorylated peptide shown above. Ions 
marked b n A have the structure b n -H3P04- C, ES-CID tandem mass spectrum of the M 3+ ion (m/z 

561.6) from the 1681 Da phosphorylated peptide shown above. Ions marked have the structure 

b n -H 3 P04-NH 3< 

Figure 2 shows the activity of hYAKl, hYAK3, and yeast YAK1 against MBP and the Serl64 
peptide. 5 ng purified hYAKl and 100 ng purified hYAK3 were used per assay. Anti-HA mAb immune 
5 complex kinase assay was performed on 100 jag protein from crude extracts of yeast cells expressing 
either FL or AN yeast YAK1 . Concentration of ATP was 100 |iM, Serl64 was used at 0.5 mM, and 
MBP was at 10 jag/reaction (18.5 uM). 

Figure 3 shows double reciprocal plots of enzyme velocity vs. ATP or Serl64 concentration. 
Steady state two substrate analysis of purified hYAKl using SI 64 as phosphoacceptor. 1 ng hYAKl was 
incubated for 30 minutes at 37 °C in hYAKl reaction mix that contains various concentrations of ATP 
and Serl64. Each ATP concentration was used with 1, 0.75, 0.5, 0.33, 0.25, 0.15, and 0.05 mM Serl64. 
Each Serl64 concentration was used with 333, 166, 100, 66.7, 33.3, 16.7, and 6.6 ^iM ATP. Reciprocal 
values of enzyme velocity were calculated and plotted against reciprocal values of the concentration of 
each substrate. Data were analyzed by GraphFit, and calculated kinetic constants are shown. 

The source of enzyme : 

1) hYAKl : DETl/DET2-tagged frill length hYAKl was expressed in Drosophila sf9 cells 
10 and purified to >95% purity using Ni column chromatography. The purified protein migrated on SDS 

gels as a single band with an apparent molecular mass of 62 kDa. Samples were stored at -80 °C until 
use. 

2) hYAK3b: Glutathione-S-Transferase (GST)/Factor Xa-tagged hYAK3b was expressed in 
baculovirus cells and purified to about 50% purity using Glutathione Sepharose 4B column 

1 5 chromatography, followed by Ni-NTA column chromatography. Samples were stored at -80 °C until 
use. 
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3) Yeast YAK1 : Full length and an amino-terrninally truncated (amino acids 148-807, termed 
AN) hemagglutinin (HA)-tagged yeast YAK1 was each expressed in a strain of & cerevisiae lacking 
the endogenous YAK1 gene and all three PKA genes. Cultures for experiments were grown in liquid 
Sc-His to an OD 60 o of at least 1.0, washed with Sc-His g/r, resuspended in Sc-His g/r to twice the 
original volume and grown for 16-24 hours at RT. Cells were washed once with H 2 0 and the 
pellets stored at -80 °C until use. To prepare lysates, cell pellets were thawed and resuspended at 1 
ml/100 ml of original culture in lysis buffer (LB) containing 50 mM Tris pH 7.5, 150 mM NaCl, 10 
pg/ml each aprotinin, leupeptin and TLCK, 0.1 mM PMSF, 50 mM NaF, 1 mM NaVanadate, 10 
mM p-glycerophosphate. Following the addition of 0.5 ml sterile acid-washed glass beads, cells 
were disrupted via ten, 30 second intervals of vortexing. NP40 was added to a 2% final 
concentration followed by rocking at 4 °C for 30-50 minutes. Lysates were clarified by high-speed 
centrifugation, and the supematants were stored at -80 °C until use. Each form of yeast YAK1 was 
immunoprecipitated from the detergent extracts using anti-HA mAb. 

Immune Complex Protein Kinase Assay fo r Yeast YAK1 : Yeast cellular extracts were 
immunoprecipitated by rocking overnight at 4°C with 4 pg of the anti-HA tag antibody and 100 pi 
of 20% suspension of protein A agarose (GIBCO-BRL) in LB that contained 1% NP-40. Samples 
were then washed twice with LB and once with basic kinase assay buffer (25 mM Hepes, pH 7.5; 1 
mM DTT; 10 mM p-glycerol phosphate; 0.2 mM NaV). Washed immune complexes were 
suspended in 20 ul of basic kinase assay buffer that contained 0.1 mM ATP, 3 pCi of [y- 32 P]ATP, 
10 mM MgCl 2 , plus either bovine MBP or the Serl64 peptide. After incubation for 15 minutes at 
30°C, the reactions were stopped by adding 20 pi of 0.15 M phosphoric acid. Phosphorylated 
substrates were isolated by spotting 20 pi of each sample on phosphocellulose (p81) filters. Filters 
were washed 3 times with 75 mM phosphoric acid followed by 3 times with H 2 0, and counted for 

32p incorporation using a 
P-scintillation counter. 

Kinase assay nf purified hYAKl and hYAK3 : The assay was performed in 96 well Minisorp 
plates (Costar, Catalog No 3356). Reaction (in 30 pi volume) mixtures contained in final 
concentrations, 25 mM Hepes buffer, pH 7.5; 0.2 mM sodium vandate; 10 mM MgCl 2 ; 1 mM DTT; 
10 mM p-glycerol phosphate; 0.1% BSA; 0.1 mM ATP, 2.5 pCi of [y-32 P ]ATP; purified hYAKl 
(1-5 ng/assay), or purified hYAK3b (50-100 ng/assay); and either bovine MBP or the Serl64 
peptide used at the concentrations indicated below and in the descriptions of the figures. Reactions 
were incubated for 20 minutes at 37°C, and were stopped by adding 10 pi of 0.3 M phosphoric acid. 
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Phosphorylated substrates were isolated by spotting 20 jal of the reaction on p81 filters, and 
processed as detailed earlier. 

This same assay can be performed on a FlashPlate format in which the plate is coated with 
MBP or with the SI 64 peptide by incubation overnight at 4 °C in 100 \x\ of either substrate 
5 dissolved in Sodium Carbonate buffer, pH 8.8. When coating with MBP, a solution of 100 fig/ml 
MBP was used to coat wells with 100 fil (10 jag) MBP per well. When coating with Ser 164, a 
solution of 0.4 mg/ml (0.25 mM) was used to coat wells with 100 jil (40 |ig) Serl64 per well. An 
example of a FlashPlate assay protocol and typical results are given below: 

FlashPlate (FF) Protocol 

10 1 . Coat Maxisorp plates with MBP or Ser 1 64 as above. 

2. Wash plates once with kinase assay buffer (KB): 25 mM Hepes, pH 7.5; 0.2 mM NaV; 10 mM P- 
glycerol phosphate; 1 mM Na pyrophosphate 

3. Add enzyme (Ni-hYAKl, diluted in KB), DMSO or inhibitors (in KB) and keep on ice for 30 
minutes 

15 4. Add KB containing Mg/ATP to a [final] of 0.1 mM [y- 33 P]ATP and 10 mM MgCl 2 

5. Incubate with shaking, 1-2 hrs, at room temperature 

6. Aspirate and wash 6 times with 0.5 ml KB 

7. Read 33 P incorporation in FP reader 

8. Blank = No enzyme added 

20 9. Reaction volume: 25, 50 or 100 ^il 

10. 0.5 or 1.0 ^iCi 33 P/0.1 mM ATP 

1 1 . MBP-FP better than basic FP (in house coating) 

12. 37 °C incubation was not better (several time points) 

13. Other incubation times at room temperature were not better 

25 . 

Results : Each kinase phosphorylated the Ser 164 peptide to much higher specific activity 
than MBP (Figure 2). Steady state kinetic constants of the hYAKl reaction were generated by 
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varying both substrates simultaneously and fitting enzyme velocity as a function of each substrate 
concentration. Double reciprocal plots (1/V vs. l/[Substrate]) with S164 peptide as the phosphate 
acceptor are shown in Figure 3. GraphFit analysis of the results generated the following steady 
state kinetic constants: 
5 K W IATP1=42 ±7 uM . 

K m |S1641=160 ±14 uM. 

\ max = 51 ±6 umol/mg. 

k C ar= 160 ±19 min " 1 " 
Typical results of FlashPlate are shown below 

10 FlashPlate Typical results 

Signal to noise ratio: >7 fold 

Blanks: 30-80 cpm 
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[Ni-hYAKl]: 

33p. 

Kinase inhibitors: 



20 Specific Activity: 



As low as 20 ng/reaction (5 nM) for 1 00 ul reactions 
As low as 8 ng/reaction (5 nM) for 25 pi reactions 
As low as 0.5 p.Ci 
Potency comparable to tube assay: 

SKF-108752 IC50: 0.19 uM (0.1 ug hYAKl) 

0.13 uM; 0.16 pM (0.3 pg hYAKl) 
K252a IC50: 0.552 uM; 0.427 uM (0.3 ug hYAKl) 

At 20 ng enzyme, MBP gave 58 ±3 (n = 6), and Serl64 gave 484 ±63 (n 
nmol/mg protein 



No effect up to 3% 
<10% (between wells and from plate to plate) 
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DMSO: 
Variability: 

Screening Assays 

The hYAKl, hYAK3a and 3b, and yeast YAK1 polypeptides of the present invention may 
employed in a screening process for compounds, which activate (agonists) or inhibit activation of 
(antagonists, or otherwise called inhibitors) the hYAKl, hYAK3a and 3b, and yeast YAK1 
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polypeptides of the present invention. Thus, polypeptides of the invention may also be used to assess 
or identify agonists or antagonists from, for example, cells, cell-free preparations, chemical libraries, 
and natural product mixtures. These agonists or antagonists may be natural substrates, ligands, 
receptors, etc., as the case may be, of the polypeptides of the present invention; or may be structural or 
5 functional mimetics of the polypeptide of the present invention. See Coligan et aL, Current Protocols 
in Immunology 1(2): Chapter 5 (1991). 

hYAKl, hYAK3a and 3b , and yeast YAK1 polypeptides are ubiquitous in the mammalian 
host and are responsible for many biological functions, including many pathologies. Accordingly, it is 
desirous to find compounds and drugs which stimulate hYAKl, hYAK3a and 3b, and yeast YAK1 

10 polypeptide on the one hand and which can inhibit the function of hYAKl , hYAK3a and 3b, and yeast 
YAK1 polypeptide on the other hand. In general, agonists are employed for therapeutic and 
prophylactic purposes for such conditions as bone loss including osteoporosis; inflammatory diseases 
such as Adult Respiratory Disease Syndrome (ARDS), Rheumatoid arthritis, Osteoarthritis, 
Inflammatory Bowel Disease (IBD), psoriasis, dermatitis, asthma, allergies; infections such as 

15 bacterial, fungal, protozoan and viral infections, particularly infections caused by HTV-l or HIV-2;' 
HTV-associated cachexia and other immunodeficiency disorders; septic shock; pain; injury; cancers; 
anorexia; bulimia; Parkinson's disease; cardiovascular disease including restenosis, atherosclerosis, 
acute heart failure, myocardial infarction; hypotension; hypertension; urinary retention; angina 
pectoris; ulcers; benign prostatic hypertrophy; and psychotic and neurological disorders, including 

20 anxiety, schizophrenia, manic depression, delirium, dementia, severe mental retardation and 

dyskinesias, such as Huntington's disease or Gilles dela Tourett's syndrome. Antagonists may be 
employed for a variety of therapeutic and prophylactic purposes for such conditions as bone loss 
including osteoporosis; inflammatory diseases such as Adult Respiratory Disease Syndrome (ARDS), 
Rheumatoid arthritis, Osteoarthritis, Inflammatory Bowel Disease (IBD), psoriasis, dermatitis, asthma, 

25 allergies; infections such as bacterial, fungal, protozoan and viral infections, particularly infections 
caused by HTV-l or HTV-2; HIV-associated cachexia and other immunodeficiency disorders; septic 
shock; pain; injury; cancers; anorexia; bulimia; Parkinson's disease; cardiovascular disease including 
restenosis, atherosclerosis, acute heart failure, myocardial infarction; hypotension; hypertension; 
urinary retention; angina pectoris; ulcers; benign prostatic hypertrophy; and psychotic and 

30 neurological disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation and dyskinesias, such as Huntington's disease or Gilles dela Tourett's syndrome. 

In general, such screening procedures may involve using appropriate cells which express the 
hYAKl, hYAK3a and 3b, and yeast YAK1 polypeptides or respond to hYAKl, hYAK3a and 3b, and 
yeast YAK1 polypeptide of the present invention. Such cells include cells from mammals, yeast, 
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Drosophila or E.coli. Cells which express the hYAKl, hYAK3a and 3b, and yeast YAKl 
polypeptides (or cell membrane containing the expressed polypeptides) or respond to hYAKl, 
hYAK3a and 3b, and yeast YAKl polypeptides are then contacted with a test compound to observe 
binding, or stimulation or inhibition of a functional response. The ability of the cells, which were 
contacted with the candidate compounds, is compared with the same cells which were not contacted 
for hYAKl , hYAK3a and 3b, and yeast YAKl activity. 

The knowledge that the hYAKl, hYAK3a and 3b, and yeast YAKl encodes a protein kinase 
suggests that recombinant forms can be used to establish a protein kinase activity. Typically this 
would involve the direct incubation of hYAKl , hYAK3a and 3b, and yeast YAKl with a protein or 
peptide substrate in the presence of Y - 32 P- ATP, followed by the measurement of radioactivity 
incorporated into the substrate by separation and counting. Separation methods include 
immunoprecipitation, conjugation of substrate to a bead allowing separation by centrifugation or 
determination of incorporation by scintillation proximity assay, SDS-PAGE followed by 
autoradiography or biosensor analysis. While the specific substrates are not yet known, candidates 
include hYAKl, hYAK3a and 3b, and yeast YAKl themselves (autophosphorylation), myelin basic 
protein, casein, histone and HSP27. Other substances might be discovered by incubating hYAKl, 
hYAK3a and 3b, and yeast YAKl with random peptides conjugated to solid supports or displayed on 
the surface of phage or by incubation of hYAKl, hYAK3a and 3b, and yeast YAKl with mammalian 
cell lysates and y- 32 P- ATP, followed by separation of the labelled target proteins, and sequencing. 
The protein kinase activity of hYAKl, hYAK3a and 3b, and yeast YAKl may require incubation with 
a specific upstream effector. This may be achieved by preincubating hYAKl , hYAK3a and 3b, and 
yeast YAKl with lysates from a variety of stimulated eukaryotic cells and ATP. These assays permit 
the discovery and modification of compounds which inhibit hYAKl, hYAK3a and 3b, and yeast 
YAKl kinase activity in vitro and would be expected to have effects on proliferation of osteoblasts, 
chondrocytes, cardiac myocytes or skeletal myoblasts. Any inhibitors so identified would be expected 
to have up-regulatory effects on proliferation and be useful as a therapeutic for the treatment and 
prevention of diseases such as osteoporosis, osteoarthritis, cardiomyopathy and cachexia. 

This invention contemplates the treatment and/or amelioration of such diseases by 
administering a hYAKl, hYAK3a and 3b, or yeast YAKl inhibiting amount of a compound. Without 
wishing to be bound by any particular theory of the functioning of the hYAKl, hYAK3a and 3b, and 
yeast YAKl polypeptides of this invention, it is believed that among the useful inhibitors of hYAKl, 
hYAK3a and 3b, and yeast YAKl function are those compounds which inhibit the kinase activity of 
the hYAKl , hYAK3a and 3b, and yeast YAKl . Other sites of inhibition are, of course, possible 
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owing to its position in a signal transduction cascade. Therefore, inhibiting the interaction of hYAKl, 
hYAK3a and 3b, and yeast YAK1 with one or more of its upstream or downstream 
modulators/substrates is also contemplated by this invention. Inhibitors of protein-protein interactions 
between hYAKl, hYAK3a and 3b, and yeast YAK1 and other factors could lead to the development 
5 of pharmaceutical agents for the modulation of hYAKl , hYAK3a and 3b, and yeast YAK1 activity. 

The assays may simply test binding of a candidate compound wherein adherence to the cells 
bearing the hYAKl, hYAK3a and 3b, and yeast YAK1 polypeptide is detected by means of a label 
directly or indirectly associated with the candidate compound or in an assay involving competition 
with a labeled competitor. Further, these assays may test whether the candidate compound results 
10 in a signal generated by activation of the hYAKl, hYAK3a and 3b, and yeast YAK1 polypeptide, 
using detection systems appropriate to the cells bearing the hYAKl, hYAK3a and 3b, and yeast 
YAK1 polypeptide. Inhibitors of activation are generally assayed in the presence of a known 
agonist and the effect on activation by the agonist by the presence of the candidate compound is 
observed. Standard methods for conducting such screening assays are well understood in the art. 

15 Examples of potential hYAKl , hYAK3a and 3b, and yeast YAK1 polypeptide antagonists 

include antibodies or, in some cases, oligonucleotides or proteins which are closely related to the 
ligands, substrates, receptors, etc., as the case may be, of the hYAKl, hYAK3a and 3b, and yeast 
YAK1 polypeptide, e.g., a fragment of the ligands, substrates, receptors, or small molecules which 
bind to the polypeptide of the present invention but do not elicit a response, so that the activity of the 

20 polypeptide is prevented. 

Prophylactic and Therapeutic Methods 

This invention provides methods of treating an abnormal condition related to both an excess 
of and insufficient amounts of hYAKl, hYAK3a and 3b, and yeast YAK1 polypeptide activity. 

If the activity of hYAKl, hYAK3a and 3b, and yeast YAK1 polypeptides is in excess, several 
25 approaches are available. One approach comprises administering to a subject an inhibitor compound 
(antagonist) as hereinabove described along with a pharmaceutically acceptable carrier in an amount 
effective to inhibit activation by blocking binding of ligands to the hYAKl, hYAK3a and 3b, and 
yeast YAK1 polypeptides, or by inhibiting a second signal, and thereby alleviating the abnormal 
condition. 

30 In another approach, soluble forms of hYAKl, hYAK3a and 3b, and yeast YAK1 

polypeptides still capable of binding the ligand in competition with endogenous hYAKl, hYAK3a 
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and 3b, and yeast YAK1 polypeptides may be administered. Typical embodiments of such 
competitors comprise fragments of the hYAKl, hYAK3a and 3b, and yeast YAK1 polypeptides. 

In still another approach, expression of the gene encoding endogenous hYAKl, hYAK3a 
and 3b, and yeast YAK1 polypeptide can be inhibited using expression blocking techniques. Known 
such techniques involve the use of antisense sequences, either internally generated or separately 
administered. See, for example, O'Connor, JNeurochem (1991) 56:560 in Oligodeoxynucleotides 
as Antisense Inhibit™-* nf Gene Expression . CRC Press, Boca Raton, FL (1988). Alternatively, 
oligonucleotides which form triple helices with the gene can be supplied. See, for example, Lee et 
at, Nucleic Acids Res (1979) 6:3073; Cooney et al, Science (1988) 241 :456; Dervan et ah, Science 
(1991) 25 1 :1360. These oligomers can be administered per se or the relevant oligomers can be 
expressed in vivo. 

For treating abnormal conditions related to an under-expression of hYAKl, hYAK3a and 3b, 
and yeast YAK1 and its activity, several approaches are also available. One approach comprises 
administering to a subject a therapeutically effective amount of a compound which activates hYAKl, 
hYAK3a and 3b, and yeast YAK1 polypeptide, i.e., an agonist as described above, in combination 
with a pharmaceutically acceptable carrier, to thereby alleviate the abnormal condition. Alternatively, 
gene therapy may be employed to effect the endogenous production of hYAKl, hYAK3a and 3b, and 
yeast YAK1 by the relevant cells in the subject. For example, a polynucleotide of the invention may 
be engineered for expression in a replication defective retroviral vector, as discussed above. The 
retroviral expression construct may then be isolated and introduced into a packaging cell transduced 
with a retroviral plasmid vector containing RNA encoding a polypeptide of the present invention such 
that the packaging cell now produces infectious viral particles containing the gene of interest. These 
producer cells may be administered to a subject for engineering cells in vivo and expression of the 
polypeptide in vivo. For overview of gene therapy, see Chapter 20, Gene Therapy and other 
Molecular Genetic-based Therapeutic Approaches, (and references cited therein) in Human Molecular 
Genetics, T Strachan and A P Read, BIOS Scientific Publishers Ltd (1996). 

Formulation and Administration 

Peptides, such as the soluble form of hYAKl, hYAK3a and 3b, and yeast YAK1 
polypeptides, and agonists and antagonist peptides or small molecules, may be formulated in 
combination with a suitable pharmaceutical carrier. Such formulations comprise a therapeutically 
effective amount of the polypeptide or compound, and a pharmaceutically acceptable carrier or 
excipient. Such carriers include but are not limited to, saline, buffered saline, dextrose, water, 
glycerol, ethanol, and combinations thereof. The formulation should suit the mode of administration, 



WO 01/32918 



PCTAJS00/30542 



- 22- 

and is well within the skill of the art. The invention further relates to pharmaceutical packs and kits 
comprising one or more containers filled with one or more of the ingredients of the aforementioned 
compositions of the invention. 

Polypeptides and other compounds of the present invention may be employed alone or in 
5 conjunction with other compounds, such as therapeutic compounds. 

Preferred forms of systemic administration of the pharmaceutical compositions include 
injection, typically by intravenous injection. Other injection routes, such as subcutaneous, 
intramuscular, or intraperitoneal, can be used. Alternative means for systemic administration include 
transmucosal and transdermal administration using penetrants such as bile salts or fusidic acids or 
10 other detergents. In addition, if properly formulated in enteric or encapsulated formulations, oral 

administration may also be possible. Administration of these compounds may also be topical and/or 
localized, in the form of salves, pastes, gels and the like. 

The dosage range required depends on the choice of peptide, the route of administration, the 
nature of the formulation, the nature of the subject's condition, and the judgment of the attending 

1 5 practitioner. Suitable dosages, however, are in the range of 0. 1 - 1 00 |ig/kg of subject. Wide variations 
in the needed dosage, however, are to be expected in view of the variety of compounds available and 
the differing efficiencies of various routes of administration. For example, oral administration would 
be expected to require higher dosages than administration by intravenous injection. Variations in these 
dosage levels can be adjusted using standard empirical routines for optimization, as is well understood 

20 in the art. 

Polypeptides used in treatment can also be generated endogenously in the subject, in treatment 
modalities often referred to as "gene therapy" as described above. Thus, for example, cells from a 
subject may be engineered with a polynucleotide, such as a DNA or RNA, to encode a polypeptide ex 
vivo, and for example, by the use of a retroviral plasmid vector. The cells are then introduced into the 
25 subject. 

Examples 

The examples below are carried out using standard techniques, which are well known and 
routine to those of skill in the art, except where otherwise described in detail. The examples illustrate, 
but do not limit the invention. 

30 Example 1 

A partial clone (ATG-355, HGS EST # 454640) was initially identified through random 
searches of the Human Genome Sciences database. This partial clone (~ lkb) showed significant 
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homology to YAK1 from S cerevisiae. To get the full length cDNA: A total of 1M plaques were 
screened from a Human Osteoblast cDNA library (Stratagene, LaJolla CA) using the insert of the 
above partial clone as a probe. Library screening procedure is described by (Elgin, et al. Strategies 4: 
8-9, 1991). The probes were a -32P labeled, using Random Primed Labeling Kit (Boheringer 
Manheim, Germany, Cat. # 1 585584 ) and purified by running over Sephadex G-50 columns 
(Pharmacia Biotech. Cat. # 17-0855-02) The hybridization and washing conditions were according to 
( J. Sambrook, E.F. Fritch and T. Maniatis (1989) A Laboratory Manual Second. Ed. Vol. 1 pp. 2.69- 
2.8 1 Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York). Five clones were 
isolated by plaque purification and a fragments containing the inserts were excised and sequenced. The 
longest insert was a 2.7 kb fragment containing the 3' untranslated region and part of the coding 
sequence of hYAKl . A second probe prepared by EcoRI-Bamffl digestion of this insert followed by 
labelling with 32P was used to screen a commercially available human heart cDNA library (Stratagene 
#936207). An additional five clones were plaque purified, excised into phagemids and sequenced. 
Fasta analysis show this peptide to have high homology to a putative serine/threonine kinase of 
unknown function from C. elegans (F49E1 1.1). 

To confirm that the cDNA was full length, Human Leukocyte "Marathon Ready" cDNA 
(Clontech, Palo Alto, CA) was used as a template to amplify a fragment corresponding to the 5 1 region 
of hYAKl using a 5' anchor primer-1 (Clontech) and a reverse gene specific primer. This fragment 
was T/A cloned into pCR2.1 (Invitrogen), and multiple isolates were sequenced. An in-frame stop 
codon was identified upstream of the predicted initiation codon confirming that the full-length cDNA 
had been obtained. 

Northern analysis was carried out to determine the distribution of hYAKl mRNA in human 
tissues. A fragment containing the 3' untranslated region of hYAKl was isolated from SEQ ID NO: 1 
using standard techniques. The isolated fragment was radiolabelled with a-3 2 P-dATP using a 
randomly primed labelling kit. Membranes containing mRNA from multiple human tissues (Clontech 
#7760-1) were hybridized with the probe and washed under high stringency conditions as directed. 
Hybridized mRNA was visualized by exposing the membranes for 4 days to X-ray film. Three major 
transcripts of 2.6, 7 and 10 kb were present and were expressed most prominently in heart and skeletal 
muscle, but were present to a lesser degree in pancreas, placenta and brain. All three transcripts 
appeared absent from kidney. 
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SEQUENCE INFORMATION 
SEQ ID NO:l 

ATG AACT CAT CCAATAATAA CGACTCGTCC 
AGCCCGACCC TTGTGACCCA CAGTGATGCT 
5 AATAGCCATA TGGGGAGAGG TATATGGAAT 
TCTCCACAGC AGCAGCATCA GAATCATCAC 
CAACAGAATT CTCAATTTTG CTTTGTTAAC 
CAACAAAACC TGGTGTATCC CCCTCAATAC 
GCGTACAGAC GACGTAAATC TAGTCTCGTT 
10 CCTTTCCAGT ACGATAGTTA TCCCGCTTAC 
AG CAGTGGC C AGTATCCTTC TGGCTATCAA 
G CTATC CATC CTTCCCAATT TGGATCCAGA 
TTCCAAAGAA GGCAGAGTCT GGCTGCAACT 
TCAAATACTA ATCAAGGAAC AAACTCCATA 
15 GCATATCCTC CTAGTACAAG TCCCCCACTA 
GAAGTACAAG GTCAGAAATT GTCCATTCCT 
CTTCAACCTG TCTTAAATG C TACTCCAAAG 
ATATCTCCCC TAGTCAGTGT CACGAAAAGC 
GAATTTACCT ACCAAACGTC CAAAAACCCC 

20 AAATGTAACA ACGGATTCGA CAACATAAAC 
TTGGGTGTAG AG C AG AAC AG AAAGTACCTT 
GGTCAAGTGG TCAAATGTCA AAATTTGCTG 
AAATCGAGGA CAGAATATTT GACTCAAAGT 
AATCAAAAGA TAG AC C CTAC TAATAAACAT 

25 CACAAGAACC ACTTATGTTT AGTGTTTGAA 
AAACAAAACA AATTTCATGG GCTCTCTATA 
CTGGATTCGT T ATG CGTTTT GAAGGAAAGC 
AATATCTTGC TCTGCGCACC TGATAAGCCG 
TGCGAAGAGG CAAGAACCGT TTATACATAC 

30 ATTATACTGG GTATACCGTA TTCAACCAGT 
GCTGAATTAT TTTTGGGTAT AC CG ATCTTC 
AGAATAATAG ACACGCTTGG ATATCCTCCA 
GGAAAATTTA TGAAGAAATT GGCACCAGAA 
ATGAAAACTA TTGAAGAGTT TTGCAGAGAA 

35 TATTTTAAGT GGAGAAAGTT ACCAGATATT 
CAGAACTCCC AAGAACTTAT CGACCAAGAA 
TTAGG CGGTG TGCTAAATTT GAACCCGTTA 
CACCCCTTCA TAACAAAGCA GGAGTTTACA 
CCGGGTCCTT CAGAAAAACA TGACGATGCA 

40 AACGACTCTA GTAACAATGC AGGCCACAAC 



- 24- 



AGCTCCAATA GCAACATGAA TAACTCCTTG 
AGTATGGGCT CGGGTAGAGC AAG CCCAGAC 
CCATCGTACG TAAATCAAGG CTCTCAAAGG 
CAGCAGCAGC AGCAGCAGCA ACAACAACAA 
CCTTGGAATG AGGAAAAAGT AACTAATTCT 
GATGACTTAA ACAGTAACGA AAGTCTAGAT 
GTACCTCCAG CCAGGGCACC TGCCCCAAAT 
ACCAGCTCTA ATACGAGTTT GGCAGGAAAT 
CAACAACAAC AGCAAGTATA CCAGCAGGGC 
TTTGTTCCCT CCCTTTATGA TCGTCAAGAT 
AATTATTCGT CCAATTTTTC TTCCCTAAAT 
CCAGTCATGT CACCTTACAG GAGGCTTAGC 
CAACCCCCTT TCAAGCAGTT ACGAAGAGAT 
C AG ATG C AAC TTTGCAACTC TAAAAACGAT 
TTTAGACGTG CGTCGCTCAA TTCCAAGACG 
CTGATTACCA CATATTCTTT ATGTTCTCCA 
AAGAGAGTAC TTACGAAGCC CAGTGAAGGG 
AGTGACTATA TTCTTTATGT AAATGACGTT 
GTGCTAGACA TTTTGGGTCA AGGTACATTT 
ACGAAAGAGA TATTGGCTGT AAAAGTGGTT 
ATAACGGAGG CTAAAATTTT AGAGCTACTG 
CATTTTTTAA GGATGTATGA CTCCTTTGTC 
TTACTAAGCA ATAATTTGTA CGAATTATTA 
CAACTTATCA GAACATTCAC CACTCAAATA 
AAACTGATTC ATTGCGATCT GAAGCCAGAG 
GAATTGAAAA TTATTGATTT TGGCTCATCT 
ATCCAGTCCA GGTTTTATCG TGCTCCTGAA 
ATTGACATGT GGTCGTTAGG TTGTATTGTT 
CCAGGCGCTT CTGAATATAA CCAATTAACA 
TCGTGGATGA TAGATATGGG TAAAAACTCT 
GAAAGTTCTT CTTCTACACA AAAG CATCGT 
TACAATATAG TGGAAAAGCC CAGTAAACAA 
ATTAGAAACT ACAGGTATCC TAAAAGCATA 
ATGCAGAATA GGGAGTGTTT GATCCACTTT 
GAAAGATGGA CACCACAACA AGCTATGCTA 
GGTGAGTGGT TTCCTCCAGG ATCGTCTTTA 
AAAGGCCAGC AAAGTGAATA TGGAAGTGCG 
TATGTCTATA ATCCTAGCTC TGCCACTGGT 
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GGTGCTGATA GCGTCGACAT TGGTGCTATC AGTAAAAGGA AGGAGAATAC ATCTGGCGAC 
ATCTCCAATA ATTTTGCTGT TACTCATTCT GTTCAAGAAG GGCCAACAAG CGCGTTCAAT 
AAAC TTC AC A TTGTCGAAGA ATAA 

SEQ ID NO:2 

5 MNSSNNNDSS SSNSNMNNSL SPTLVTHSDA SMGSGRASPD NSHMGRGIWN PSYVNQGSQR 
SPQQQHQNHH QQQQQQQQQQ QQNSQFCFVN PWNEEKVTNS QQNLVYPPQY DDLNSNESLD 
AYRRRKSSLV VPPARAPAPN PFQYDSYPAY TSSNTSLAGN SSGQYPSGYQ QQQQQVYQQG 
AIHPSQFGSR FVPSLYDRQD FQRRQSLAAT NYSSNFSSLN SNTNQGTNSI PVMSPYRRLS 
AYPPSTSPPL QPPFKQLRRD EVQGQKLSIP QMQLCNSKND LQPVLNATPK FRRASLNSKT 

10 ISPLVSVTKS LITTYSLCSP EFTYQTSKNP KRVLTKPSEG KCNNGFDNIN SDYILYVNDV 
LGVEQNRKYL VLDILGQGTF GQWKCQNLL TKEILAVKW KSRTEYLTQS ITEAKILELL 
NQKIDPTNKH HFLRMYDSFV HKNHLCLVFE LLSNNLYELL KQNKFHGLSI QLIRTFTTQI 
LDSLCVLKES KLIHCDLKPE NILLCAPDKP ELKIIDFGSS CEEARTVYTY IQSRFYRAPE 
IILGIPYSTS IDMWSLGCIV AELFLGIPIF PGASEYNQLT RIIDTLGYPP SWMIDMGKNS 

15 GKFMKKLAPE ESSSSTQKHR MKTIEEFCRE YNIVEKPSKQ YFKWRKLPDI IRNYRYPKSI 
QNSQELIDQE MQNRECLIHF LGGVLNLNPL ERWTPQQAML HPFITKQEFT GEWFPPGSSL 
PGPSEKHDDA KGQQS EYGS A NDSSNNAGHN YVYNPSSATG GADSVDIGAI SKRKENTSGD 
I SNNFAVTHS VQEGPTSAFN KLHIVEE 

SEQ ID NO:3 

GGAAACCTTC GGCCGCCGCT CCCGCCGCCT ACCCGACCGA TTGGCGG C AG TAAGCACACA 
ATGAATGATC ACCTGCATGT CGGCAGCCAC GCT CACGGAC AGATCCAGGT TCAACAGTTG 
TTTGAGGATA ACAGTAACAA GCGGACAGTG CTCACGACAC AACCAAATGG GCTTACAACA 
GTGGGCAAAA CGGGCTTGCC AGTGGTGCCA GAGCGGCAGC TGGACAGCAT TCATAGACGG 
CAGGGGAGCT CCACCTCTCT AAAGTCCATG GAAGGCATGG GGAAGGTGAA AGCCACCCCC 
ATGACACCTG AACAAGCAAT GAAGCAATAC ATGCAAAAAC TCACAGCCTT CGAACACCAT 
GAGATTTTCA GCTACCCTGA AATATATTTC TTGGGTCTAA ATGCTAAGAA GCGCCAGGGC 
ATGACAGGTG GGCCCAACAA TGGTGGCTAT GATGATGACC AGGGATCATA TGTGCAGGTG 
CCCCACGATC ACGTGGCTTA CAGGTATGAG GTCCTCAAGG TCATTGGGAA GGGGAGCTTT 
GGGCAGGTGG TCAAGGCCTA CGATCACAAA GTCCACCAGC ACGTGGCCCT AAAGATGGTG 
CGGAATGAGA AGCGCTTCCA CCGGCAAGCA GCGGAGGAGA TCCGAATCCT GGAACACCTG 
CGGAAGCAGG ACAAGGATAA CACAATGAAT GTCATCCATA TGCTGGAGAA TTTCACCTTC 
CGCAACCACA TCTGCATGAC GTTTGAGCTG CTGAGCATGA ACCTCTATAA GCTCATCAAG 
AAGAATAAAT TCCAGGGCTT CAGTCTGCCT TTGGTTCGCA AGTTTGCCCA CTCGATTCTG 
CAGTGCTTGG ATGCTTTGCA CAAAAACAGA ATAATTCACT GTGACCTTAA GCCCGAGAAC 
ATTTTGTTAA AGCAGCAGGG TAGAAGCGGT ATTAAAGTAA TTGATTTTGG CTCCAGTTGT 
TACGAGCATC AGCGTGTCTA CACGTACATC CAGTCG CGTT TTTACCGGGC TCCAGAAGTG 
ATCCTTGGGG CCAGGTATGG CATGCCCATT GATATGTGGA GCCTGGGCTG CATTTTAGCA 
GAGCTCCTGA CGGGTTACCC CCTCTTGCCT GGGGAAGATG AAGGGGACCA GCTGGCCTGT 
ATGATTGAAC TGTTGGGCAT GCCCTCACAG AAACTGCTGG ATGCATCCAA ACGAGCCAAA 
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AATTTTGTGA GCTCCAAGGG TTATCCCCGT TACTG CACTG TCACGACTCT CTCAGATGGC 
TCTGTGGTCC TAAACGGAGG CCGTTCCCGG AGGGGGAAAC TGAGGGGCCC ACCGGAGAGC 
AGAGAGTGGG GTAACGCGCT GAAGGGGTGT GATGATCCCC TTTTCCTTGA CTTCTTAAAA 
CAGTGTTTAG AGTGGGATCC TGCAGTGCGC ATGACCCCAG GCCAGGCTTT GCGGCACCCC 
TGGCTGAGGA GGCGGTTGCC AAAGCCTCCC ACCGGGGAGA AAACGTCAGT GAAAAGGATA 
ACTGAGAGCA CCGGTGCTAT CACATCTATA TCCAAGTTAC CTCCACCTTC TAGCTCAGCT 
TCCAAACTGA GGACTAATTT GGCGCAGATG ACAGATGCCA ATGGGAATAT TCAGCAGAGG 
ACAGTGTTGC CAAAACTTGT TAGCTGAGCT CACGTCCCCT GATGCTGGTA ACCTGAAAGA 
TACGACATTG CTGAGCCTTA CTGGGTTGAA AAGGAGTAGC TCAGACCTGT TTTTATTTGC 
TCAATAACTC TACTCATTTG TATCTTTTCA GCACTTAATT TTAATGTAAG AAAGTTGTTC 
ATTTTGTTTT TATAAAATAC ATGAGGACAA TGCTTTAAGT TTTTATACTT TCAGAAACTT 
TTTGTGTTCT AAAAGTACAA TGAGCCTTAC TGTATTTAGT GTGGCAGAAT AATAACATCA 
ATGGCAGGCC ACTGATTACT TCATGACTGC CACGCATTTA CAGATTGGTG TCAAAGACAT 
TCACTATGTT TTTATGGTTC ATGTTATATC CTCCCCAGGG TGACAGCCCC TTAAGGCCCT 
CCTTTTCCCT CCATGCTCCA GGTCCATGCA CAGGTGTAGC ATGTC 



SEQ ID NO:4 



Met 


Asn 


Asp 


His 


Leu 


His 


Val 


Gly 


Ser 


His 


Ala 


His 


Gly 


Gin 


He 


Gin 


Val 


Gin 


Gin 


Leu 


Phe 


Glu 


Asp 


Asn 


Ser 


Asn 


Lys 


Arg 


Thr 


Val 


Leu 


Thr 


Thr 


Gin 


Pro 


Asn 


Gly 


Leu 


Thr 


Thr 


Val 


Gly 


Lys 


Thr 


Gly 


Leu 


Pro 


Val 


Val 


Pro 


Glu 


Arg 


Gin 


Leu 


Asp 


Ser 


He 


His 


Arg 


Arg 


Gin 


Gly 


Ser 


Ser 


Thr 


Ser 


Leu 


Lys 


Ser 


Met 


Glu 


Gly 


Met 


Gly 


Lys 


Val 


Lys 


Ala 


Thr 


Pro 


Met 


Thr 


Pro 


Glu 


Gin 


Ala 


Met 


Lys 


Gin 


Tyr 


Met 


Gin 


Lys 


Leu 


Thr 


Ala 


Phe 


Glu 


His 


His 


Glu 


He 


Phe 


Ser 


Tyr 


Pro 


Glu 


He 


Tyr 


Phe 


Leu 


Gly 


Leu 


Asn 


Ala 


Lys 


Lys 


Arg 


Gin 


Gly 


Met 


Thr 


Gly 


Gly 


Pro 


Asn 


Asn 


Gly 


Gly 


Tyr 


Asp 


Asp 


Asp 


Gin 


Gly 


Ser 


Tyr 


Val 


Gin 


Val 


Pro 


His 


Asp 


His 


Val 


Ala 


Tyr 


Arg 


Tyr 


Glu 


Val 


Leu 


Lys 


Val 


He 


Gly 


Lys 


Gly 


Ser 


Phe 


Gly 


Gin 


Val 


Val 


Lys 


Ala 


Tyr 


Asp 


His 


Lys 


Val 


His 


Gin 


His 


Val 


Ala 


Leu 


Lys 


Met 


Val 


Arg 


Asn 


Glu 


Lys 


Arg 


Phe 


His 


Arg 


Gin 


Ala 


Ala 


Glu 


Glu 


He 


Arg 


He 


Leu 


Glu 


His 


Leu 


Arg 


Lys 


Gin 


Asp 


Lys 


Asp 


Asn 


Thr 


Met 


Asn 


Val 


He 


His 


Met 


Leu 


Glu 


Asn 


Phe 


Thr 


Phe 


Arg 


Asn 


His 


He 


Cys 


Met 


Thr 


Phe 


Glu 


Leu 


Leu 


Ser 


Met 


Asn 


Leu 


Tyr 


Lys 


Leu 


He 


Lys 


Lys 


Asn 


Lys 


Phe 


Gin 


Gly 


Phe 


Ser 


Leu 


Pro 


Leu 


Val 


Arg 


Lys 


Phe 


Ala 


His 


Ser 


He 


Leu 


Gin 


Cys 


Leu 


Asp 


Ala 


Leu 


His 


Lys 


Asn 


Arg 


He 


He 


His 


Cys 


Asp 


Leu 


Lys 


Pro 


Glu 


Asn 


He 


Leu 


Leu 


Lys 


Gin 


Gin 


Gly 


Arg 


Ser 


Gly 


He 


Lys 


Val 


He 


Asp 


Phe 


Gly 


Ser 


Ser 


Cys 


Tyr 


Glu 


His 


Gin 


Arg 


Val 


Tyr 


Thr 


Tyr 


He 


Gin 


Ser 


Arg 


Phe 


Tyr 


Arg 


Ala 


Pro 


Glu 


Val 


He 


Leu 


Gly 


Ala 


Arg 


Tyr 


Gly 


Met 


Pro 


He 


Asp 


Met 


Trp 


Ser 


Leu 


Gly 


Cys 


He 


Leu 


Ala 


Glu 


Leu 


Leu 


Thr 


Gly 


Tyr 


Pro 


Leu 


Leu 


Pro 


Gly 


Glu 


Asp 


Glu 


Gly 


Asp 


Gin 


Leu 


Ala 


Cys 


Met 


He 


Glu 


Leu 


Leu 


Gly 


Met 


Pro 


Ser 


Gin 


Lys 


Leu 


Leu 


Asp 


Ala 


Ser 


Lys 


Arg 


Ala 


Lys 


Asn 


Phe 


Val 


Ser 


Ser 


Lys 


Gly 


Tyr 


Pro 


Arg 


Tyr 


Cys 


Thr 


Val 


Thr 


Thr 


Leu 


Ser 


Asp 


Gly 


Ser 


Val 


Val 


Leu 


Asn 


Gly 


Gly 


Arg 


Ser 


Arg 


Arg 


Gly 


Lys 


Leu 


Arg 


Gly 


Pro 


Pro 


Glu 


Ser 


Arg 


Glu 


Trp 


Gly 


Asn 


Ala 


Leu 


Lys 


Gly 


Cys 


Asp 


Asp 
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Pro 


Leu 


Phe 


Leu 


Asp 


Phe 


Leu 


Lys 


Gin 


Cys 


Leu 


Glu 


Trp 


Asp 


Pro 


Ala 


Val 


Arg 


Met 


Thr 


Pro 


Gly 


Gin 


Ala 


Leu 


Arg 


His 


Pro 


Trp 


Leu 


Arg 


Arg 


Arg 


Leu 


Pro 


Lys 


Pro 


Pro 


Thr 


Gly 


Glu 


Lys 


Thr 


Ser 


Val 


Lys 


Arg 


He 


Thr 


Glu 


Ser 


Thr Gly 


Ala 


lie 


Thr 


Ser 


He 


Ser 


Lys 


Leu 


Pro 


Pro 


Pro 


Ser 


Ser 


Ser 


Ala 


Ser 


Lys 


Leu 


Arg 


Thr 


Asn 


Leu 


Ala 


Gin 


Met 


Thr 


Asp 


Ala 


Asn Gly Asn 


He 


Gin 


Gin 


Arg 


Thr 


Val 


Leu 


Pro 


Lys 


Leu 


Val 


Ser 



























SEQ ID NO:5 

GGAGCGAAAT GCGCTGAGCT GCAGTGTCTG GTCGAGAGTA C CCGTGGG AG CGTCGCGCCG 
CGGAGGC AG C CGTCCCGGCG TAGGTGGCGT GGCCGACCGG ACCCCCAACT GGCGCCTCTC 
CCCGCGCGGG GTCCCGAGCT AGGAGATGGG AGGCACAGCT CGTGGGCCTG GGCGGAAGGA 
TGCGGGGCCG CCTGGGGCCG GGCTCCCGCC CCAGCAGCGG AGGTTGGGGG ATGGTGTCTA 
TGACACCTTC ATGATGATAG ATGAAACCAA ATGTCCCCCC TGTTCAAATG TACTCTGCAA 
TCCTTCTGAA CCACCTCCAC CCAGAAGACT AAATATGACC ACTGAGCAGT TTACAGGAGA 
TCATACTCAG CACTTTTTGG ATGGAGGTGA GATGAAGGTA GAACAGCTGT TTCAAGAATT 
TGGCAACAGA AAATCCAATA CTATTCAGTC AGATGGCATC AGTGACTCTG AAAAATGCTC 
TCCTACTGTT TCTCAGGGTA AAAGTTCAGA TTGCTTGAAT ACAGTAAAAT CCAACAGTTC 
ATCCAAGGCA CCCAAAGTGG TGCCTCTGAC TCCAGAACAA GCCCTGAAGC AATATAAACA 
CCACCTCACT GCCTATGAGA AACTGGAAAT AATTAATTAT C CAG AAATTT ACTTTGTAGG 
TCCAAATGCC AAGAAAAGAC ATGGAGTTAT TGGTGGTCCC AATAATGGAG GGTATGATGA 
TGCAGATGGG GCCTATATTC ATGTACCTCG AGACCATCTA GCTTATCGAT ATGAGGTGCT 
GAAAATTATT GGCAAGGGGA GTXTTGGGCA GGTGGCCAGG GTCTATGATC ACAAACTTCG 
ACAGTACGTG GCCCTAAAAA TGGTGCGCAA TGAGAAGCGC TTTCATCGTC AAGCAGCTGA 
GGAGATCCGG ATTTTGGAGC ATCTTAAGAA ACAGGATAAA ACTGGTAGTA TGAACGTTAT 
CCACATGCTG GAAAGTTTCA CATTCCGGAA CCATGTTTGC ATGGCCTTTG AATTGCTGAG 
CATAGACCTT TATGAGCTGA TTAAAAAAAA TAAGTTTCAG GGTTTTAGCG TCCAGTTGGT 
ACGCAAGTTT GCCCAGTCCA TCTTGCAATC TTTGGATGCC CTCCACAAAA ATAAGATTAT 
TCACTGCGAT CTGAAGCCAG AAAACATTCT CCTGAAACAC CACGGGCGCA GTTCAACCAA 
GGTCATTGAC TTTGGGTCCA GCTGTTTCGA GTACCAGAAG CTCTACACATATATCCAGTC 
TCGGTTCTAC AGAGCTCCAG AAATCATCTT AGGAAGCCGC TACAGCACAC CAATTGACAT 
ATGGAGTTTT GGCTGCATCC TTGCAGAACT TTTAACAGGA CAGCCTCTCT TCCCTGGAGA 
GGATGAAGGA GACCAGTTGG CCTGCATGAT GGAGCTTCTA GGGATGCCAC CACCAAAACT 
TCTGGAGCAA TCCAAACGTG CCAAGTACTT TATTAATTCC AAGGGCATAC CCCGCTACTG 
CTCTGTGACT ACCCAGGCAG ATGGGAGGGT TGTGCTTGTG GGGGGTCGCT CACGTAGGGG 
TAAAAAGCGG GGTCCCCCAG GCAGCAAAGA CTGGGGGACA GCACTGAAAG GGTGTGATGA 
CTACTTGTTT ATAGAGTTCT TGAAAAGGTG TCTTCACTGG GACCCCTCTG CCCGCTTGAC 
CCCAGCTCAA GCATTAAGAC AC CCTTGG AT TAGCAAGTCT GTCCCCAGAC CTCTCACCAC 
CATAGACAAG GTGTCAGGGA AACGGGTAGT TAATCCTGCA AGTGCTTTCC AGGGATTGGG 
TTCTAAGCTG CCTCCAGTTG TTGGAATAGC CAATAAGCTT AAAGCTAACT TAATGTCAGA 
AACCAATGGT AGTATACCCC TATGCAGTGT ATTGCCAAAA CTGATTAGCT AGTGGACAGA 
GATATGCCCA GAGATGCATA TGTGTATATT TTTATGATCT TACAAACCTG CAAATGGAAA 
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AAATGCAAGC CCATTGGTGG ATGTTTTTGT TAGAGTAGAC TTTTTTTAAA CAAGACAAAA 
CATTTTTATA TGATTATAAA A 



SEQ ID NO:6 

MGGTARGPGR KDAGPPGAGL PPQQRRLGDG VYDTFMMIDE TKCPPCSNVL CNPSEPPPPR 
5 RLNMTTEQFT GDHTQHFLDG GEMKVEQLFQ EFGNRKSNTI QSDGISDSEK CSPTVSQGKS 
SDCLNTVKSN S S S KAPKWPLTPEQ ALKQ Y KHHLTAYEKL EIINYPEIYF VGPNAKKRHG 
VIGGPNNGGY DDADGAYIHV PRDHLAYRYE VLKIIGKGSF GQVARVYDHK LRQYVALKMV 
RNEKRFHRQA AEEIRILEHL KKQDKTGSMN VIHMLESFTF RNHVCMAFEL LSIDLYELIK 
KNKFQGFSVQ LVRKFAQSIL QS LDALHKNK IIHCDLKPEN ILLKHHGRSS TKVIDFGSSC 
10 FEYQKLYTYI QSRFYRAPEI ILGSRYSTPI DIWSFGCILA ELLTGQPLFP GEDEGDQLAC 
MMELLGMPPP KLLEQSKRAK YF INS KG I PR YCSVTTQADG RWLVGGRSR RGKKRGPPGS 
KDWGTALKGC DDYLFIEFLK RCLHWD PS AR LTPAQALRHP WISKSVPRPL TTIDKVSGKR 
WNPASAFQG LGSKLPPWG IANKLKANLM SETNGSIPLC SVLPKLIS 

SEQ ID NO:7 

CGGCGCTGGC AAGCGAAGCT TGGGGGTGGG GAGGTAGAGT GAGCCCTCAG TAGGAGGGAC 

GAGGGCAGGG GTCTGACTGC CTCCCCGGGA CCGCCCCCAC CTCCTCTCTA TCAGGGCCCC 

CTCCCCCCAT CCCTGTCTCA CCGGGCGCGG GGGACGGGGC TAG AG CGGAG TTAGAGCAAG 

AAGAATTTCC ACCCCTGGAT TCCCTCTGAA ACCCTAGATC GGGGTATATG TTAAGGGATT 

ACGAAAATCT AGGACTTTTT GTGGGGCTTT TTATTAAAGG GGGGGAGCCC GGGAGCAATA 

CCTTGGAAAG AAGCCCTGTT GCTTAGAGCG GATAACCAAC GGCTGAACTC TTGGGGTTTG 

CTGTGAGGGG TGCGGTCTAG CTTCGAATGT ACAGTGGTGG AGCCACAGTG TTAAAGAACA 

GAGAAGTGAT CCTTAATCAT TTAGAATTTT GCCTCCACCA TCCACCAGAA AATGAAGTGG 

AAAGAGAAGT TGGGGGATGG TGTCTATGAC ACCTTCATGA TGATAGATGA AAC CAAATGT 

~CCCCCCTGTT CAAATGTACT" CTGCAATCCT TCTGAACCAC" CTCCACCCAG AAGACTAAAT 

ATGACCACTG AGCAGTTTAC AGGAGATCAT ACTCAGCACT TTTTGGATGG AGGTGAGATG 

AAGGTAGAAC AGCTGTTTCA AGAATTTGGC AACAGAAAAT CCAATACTAT TCAGTCAGAT 

GGCATCAGTG ACTCTGAAAA ATGCTCTCCT ACTGTTTCTC AGGGTAAAAG TTCAGATTGC 

TTGAATACAG TAAAATCCAA CAGTTCATCC AAGGCACCCA AAGTGGTGCC TCTGACTCCA 

GAACAAGCCC TGAAGCAATA TAAACACCAC CTCACTGCCT ATGAGAAACT GGAAATAATT 

AATTATCCAG AAATTTACTT TGTAGGTCCA AATG CCAAGA AAAGACATGG AGTTATTGGT 

GGTCCCAATA ATGGAGGGTA TGATGATGCA GATGGGGCCT ATATTCATGT ACCTCGAGAC 

CATCTAGCTT ATCGATATGA GGTGCTGAAA ATTATTGG C A AGGGGAGTTT TGGGCAGGTG 

GCCAGGGTCT ATGATCACAA ACTTCGACAG TACGTGGCCC TAAAAATGGT GCGCAATGAG 

AAGCGCTTTC ATCGTCAAGC AGCTGAGGAG ATCCGGATTT TGGAGCATCT TAAGAAACAG 

GATAAAACTG GTAGTATGAA CGTTATCCAC ATGCTGGAAA GTTTCACATT CCGGAACCAT 

GTTTGCATGG CCTTTGAATT GCTGAGCATA GACCTTTATG AGCTGATTAA AAAAAATAAG 

TTTCAGGGTT TTAGCGTCCA GTTGGTACGC AAGTTTGCCC AGTCCATCTT GCAATCTTTG 
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GATGCCCTCC ACAAAAATAA GATTATTCAC TGCGATCTGA AGCCAGAAAA CATTCTCCTG 
AAACACCACG GGCGCAGTTC AACCAAGGTC ATTGACTTTG GGTCCAGCTG TTTCGAGTAC 
CAGAAGCTCT ACACATATAT CCAGTCTCGG TTCTACAGAG CTCCAGAAAT CATCTTAGGA 
AGCCGCTACA GCACACCAAT TGACATATGG AGTTTTGGCT GCATCCTTGC AGAACTTTTA 
ACAGGACAGC CTCTCTTCCC TGGAGAGGAT GAAGGAGACC AGTTGGCCTC CATGATGGAG 
CTTCTAGGGA TGCCACCACC AAAACTTCTG GAGCAATCCA AACGTGCCAA GT AC TTTATT 
AATTCCAAGG GCATACCCCG CTACTGCTCT GTGACTACCC AGGCAGATGG GAGGGTTGTG 
CTTGTGGGGG GTCGCTCACG TAGGGGTAAA AAGCGGGGTC CCCCAGGCAG CAAAGACTGG 
GGGACAGCAC TGAAAGGGTG TGATGACTAC TTGTTTATAG AGTTCTTGAA AAGGTGTCTT 
CACTGGGACC CCTCTGCCCG CTTGACCCCA GCTCAAGCAT TAAGACACCC TTGGATTAGC 
AAGTCTGTCC CCAGACCTCT CACCACCATA GACAAGGTGT CAGGGAAACG GGTAGTTAAT 
CCTGCAAGTG CTTTCCAGGG ATTGGGTTCT AAGCTGCCTC CAGTTGTTGG AATAGCCAAT 
AAGCTTAAAG CTAACTTAAT GTCAGAAACC AATGGTAGTA TACCCCTATG CAGTGTATTG 
C C AAAACTG A TTAGCTAGTG GACAGAGATA TGCCCAGAGA TGCATATGTG TATATTTTTA 
TGATCTTACA AACCTG CAAA TGGAAAAAAT GCAAGCC CAT TGGTGGATGT TTTTGTTAGA 
GTAGACTTTT TTTAAACAAG ACAAAACATT TTTATATGAT TATAAAA 

SEQ ID NO:8 

Met Lys Trp Lys Glu Lys Leu Gly Asp Gly Val Tyr Asp Thr Phe Met Met He 
Asp Glu Thr Lys Cys Pro Pro Cys Ser Asn Val Leu Cys Asn Pro Ser Glu Pro 
Pro Pro Pro Arg Arg Leu Asn Met Thr Thr Glu Gin Phe Thr Gly Asp His Thr 
Gin His Phe Leu Asp Gly Gly Glu Met Lys Val Glu Gin Leu Phe Gin Glu Phe 
Gly Asn Arg Lys Ser Asn Thr He Gin Ser Asp Gly He Ser Asp Ser Glu Lys 
Cys Ser Pro Thr Val Ser Gin Gly Lys Ser Ser Asp Cys Leu Asn Thr Val Lys 
Ser Asn Ser Ser Ser Lys Ala Pro Lys Val Val Pro Leu Thr Pro Glu Gin Ala 
Leu Lys Gin Tyr Lys His His Leu Thr Ala Tyr Glu Lys Leu Glu He He Asn 
Tyr Pro Glu He Tyr Phe Val Gly Pro Asn Ala Lys Lys Arg His Gly Val He 
Gly Gly Pro Asn Asn Gly Gly Tyr Asp Asp Ala Asp Gly Ala Tyr He His Val 
Pro Arg Asp His Leu Ala Tyr Arg Tyr Glu Val Leu Lys He He Gly Lys Gly 
Ser Phe Gly Gin Val Ala Arg Val Tyr Asp His Lys Leu Arg Gin Tyr Val Ala 
Leu Lys Met Val Arg Asn Glu Lys Arg Phe His Arg Gin Ala Ala Glu Glu He 
Arg He Leu Glu His Leu Lys Lys Gin Asp Lys Thr Gly Ser Met Asn Val He 
His Met Leu Glu Ser Phe Thr Phe Arg Asn His Val Cys Met Ala Phe Glu Leu 
Leu Ser He Asp Leu Tyr Glu Leu He Lys Lys Asn Lys Phe Gin Gly Phe Ser 
Val Gin Leu Val Arg Lys Phe Ala Gin Ser He Leu Gin Ser Leu Asp Ala Leu 
His Lys Asn Lys He He His Cys Asp Leu Lys Pro Glu Asn He Leu Leu Lys 
His His Gly Arg Ser Ser Thr Lys Val He Asp Phe Gly Ser Ser Cys Phe Glu 
Tyr Gin Lys Leu Tyr Thr Tyr He Gin Ser Arg Phe Tyr Arg Ala Pro Glu He 
He Leu Gly Ser Arg Tyr Ser Thr Pro He Asp He Trp Ser Phe Gly Cys He 
Leu Ala Glu Leu Leu Thr Gly Gin Pro Leu Phe Pro Gly Glu Asp Glu Gly Asp 
Gin Leu Ala Ser Met Met Glu Leu Leu Gly Met Pro Pro Pro Lys Leu Leu Glu 
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Gin Ser Lys Arg Ala Lys Tyr Phe lie 
Ser Val Thr Thr Gin Ala Asp Gly Arg 
Arg Gly Lys Lys Arg Gly Pro Pro Gly 
Gly Cys Asp Asp Tyr Leu Phe lie Glu 
Pro Ser Ala Arg Leu Thr Pro Ala Gin 
Ser Val Pro Arg Pro Leu Thr Thr lie 
Asn Pro Ala Ser Ala Phe Gin Gly Leu 
lie Ala Asn Lys Leu Lys Ala Asn Leu 
Leu Cys Ser Val Leu Pro Lys Leu lie 

SEQ ID NO:9 

Leu Gly Gly Arg Asp Ser Arg Ser Gly 
SEQ ID NO:10 

Leu Gly Gly Arg Asp Ser Arg Ser Gly 
5 SEQEDNO:ll 

Leu Gly Gly Arg Asp Ser Arg Ser Gly 
SEQ ID NO: 12 

Leu Gly Gly Arg Asp Ser Arg Ser Gly 
SEQ ID NO: 13 
10 Leu Gly Gly Arg Asp Ser Arg Ser Pro 
SEQIDNO:14 

Leu Gly Gly Arg Asp Ser Arg Ser Pro 
SEQIDNO:15 

Leu Gly Gly Arg Asp Ser Arg Ala Gly 
15 SEQIDNO:16 

Leu Gly Gly Arg Asp Ala Arg Ser Gly 
SEQ ID NO: 17 



30- 



Asn 


Ser 


Lys 


Gly 


He 


Pro 


Arg 


Tyr 


Cys 


Val 


Val 


Leu 


Val 


Gly 


Gly 


Arg 


Ser 


Arg 


Ser 


Lys 


Asp 


Trp 


Gly 


Thr 


Ala 


Leu 


Lys 


Phe 


Leu 


Lys 


Arg 


Cys 


Leu 


His 


Trp 


Asp 


Ala 


Leu Arg 


His 


Pro 


Trp 


He 


Ser 


Lys 


Asp 


Lys 


Val 


Ser Gly 


Lys 


Arg 


Val 


Val 


Gly 


Ser 


Lys 


Leu 


Pro 


Pro 


Val 


Val 


Gly 


Met 


Ser 


Glu 


Thr Asn Gly 


Ser 


He 


Pro 


Ser 


















Ser 


Pro 


Met 


Ala 


Arg 


Arg 








Ser 


Ala 


Met 


Ala 


Arg 


Arg 


(Peptide 


1) 


Thr 


Pro 


Met 


Ala 


Arg 


Arg 


(Peptide 


2) 


Ser 


Pro 


Pro 


Ala 


Arg 


Arg 


(Peptide 


3) 


Ser 


Pro 


Met 


Ala 


Arg 


Arg 


(Peptide 


4) 


Ser 


Ala 


Met 


Ala 


Arg 


Arg 


(Peptide 


5) 


Ser 


Pro 


Met 


Ala 


Arg 


Arg 


(peptide 


6) 


Ser 


Pro 


Met 


Ala 


Arg 


Arg 


(Peptide 


7) 
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Leu Gly Gly Arg Asp Ala Arg Ala Gly Ser Pro Met Ala Arg Arg (peptide 8) 

SEQ ID NO:18 

NIVTPRTPPPSQGK 
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What is claimed is: 

1 . A method for screening for an agent that modulates a YAK polypeptide kinase activity 
comprising: 

a. contracting a candidate agent with a YAK polypeptide, wherein the step of 

5 contacting is carried out under conditions and for a time sufficient to allow the candidate agent and 
the polypeptide to interact; and 

b. subsequently measuring the ability of the candidate agent to modulate the ability 
of the polypeptide to phosphorylate the substrate. 

2. The method of claim 1 wherein the YAK polypeptide has an amino acid sequence 
10 selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and SEQ ID 

NO:8. 

3. The method of claim 2 wherein the substrate comprises the SER-164 polypeptide. 
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Figure 2 

hYAKl 
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Figure 3 



Steady State Two Substrate Kinetic Analysis 




Kinetic Constants (GraphFit) 



Km [ATP] = 42 ±7 uM K»,[S164] = 160 ±14 uM 
V max = 51 ±6 umol/mg kcat = 1 60 ±19 min-1 
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SEQUENCE LISTING 

<110> SMITHKLINE BEECHAM CORPORATION 
SMITHKLINE BEEHCAM PLC 

<120> SPECIFIC PEPTIDE SUBSTRATE FOR HUMAN 

YAK1, YAK3A, AND YAK3B AND YEAST YAK1 PROTEIN KINASES 



<130> GH-70650WO 

<14 0> TO BE ASSIGNED 
<141> 2000-11-06 

<150> 60/163,901 
<151> 1999-11-05 

<150> TO BE ASSIGNED 

<151> 2000-11-03 

<160> 18 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 2424 

<212> DNA 

<213> HOMO SAPIENS 



<400> 1 

atgaactcat ccaataataa cgactcgtcc agctccaata gcaacatgaa taactccttg 60 

agcccgaccc ttgtgaccca cagtgatgct agtatgggct cgggtagagc aagcccagac 120 

aatagccata tggggagagg tatatggaat ccatcgtacg taaatcaagg ctctcaaagg 180 

tctccacagc agcagcatca gaatcatcac cagcagcagc agcagcagca acaacaacaa 240 

caacagaatt ctcaattttg ctttgttaac ccttggaatg aggaaaaagt aactaattct 300 

caacaaaacc tggtgtatcc ccctcaatac gatgacttaa acagtaacga aagtctagat 360 

gcgtacagac gacgtaaatc tagtctcgtt gtacctccag ccagggcacc tgccccaaat 420 

cctttccagt acgatagtta tcccgcttac accagctcta atacgagttt ggcaggaaat 480 

agcagtggcc agtatccttc tggctatcaa caacaacaac agcaagtata ccagcagggc 54 0 

gctatccatc cttcccaatt tggatccaga tttgttccct ccctttatga tcgtcaagat 600 

ttccaaagaa ggcagagtct ggctgcaact aattattcgt ccaatttttc ttccctaaat 660 

tcaaatacta atcaaggaac aaactccata ccagtcatgt caccttacag gaggcttagc 720 

gcatatcctc ctagtacaag tcccccacta caaccccctt tcaagcagtt acgaagagat 780 

gaagtacaag gtcagaaatt gtccattcct cagatgcaac tttgcaactc taaaaacgat 840 

cttcaacctg tcttaaatgc tactccaaag tttagacgtg cgtcgctcaa ttccaagacg 900 

atatctcccc tagtcagtgt cacgaaaagc ctgattacca catattcttt atgttctcca 960 

gaatttacct accaaacgtc caaaaacccc aagagagtac ttacgaagcc cagtgaaggg 1020 

aaatgtaaca acggattcga caacataaac agtgactata. ttctttatgt aaatgacgtt 1080 

ttgggtgtag agcagaacag aaagtacctt gtgctagaca 'ttttgggtca aggtacattt 1140 

ggtcaagtgg tcaaatgtca aaatttgctg acgaaagaga tattggctgt aaaagtggtt 1200 

aaatcgagga cagaatattt gactcaaagt ataacggagg ctaaaatttt agagctactg 1260 

aatcaaaaga tagaccctac taataaacat cattttttaa ggatgtatga ctcctttgtc 1320 

cacaagaacc acttatgttt agtgtttgaa ttactaagca ataatttgta cgaattatta 1380 
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aaacaaaaca 
ctggattcgt 
aatatcttgc 
tgcgaagagg 
attatactgg 
gctgaattat 
agaataatag 
ggaaaattta 
atgaaaacta 
tattttaagt 
cagaactccc 
ttaggcggtg 
caccccttca 
ccgggtcctt 
aacgactcta 
ggtgctgata 
atctccaata 
aaacttcaca 



aatttcatgg 
tatgcgtttt 
tctgcgcacc 
caagaaccgt 
gtataccgta 
ttttgggtat 
acacgcttgg 
tgaagaaatt 
ttgaagagtt 
ggagaaagtt 
aagaacttat 
tgctaaattt 
taacaaagca 
cagaaaaaca 
gtaacaatgc 
gcgtcgacat 
attttgctgt 
ttgtcgaaga 



gctctctata 
gaaggaaagc 
tgataagccg 
ttatacatac 
ttcaaccagt 
accgatcttc 
atatcctcca 
ggcaccagaa 
ttgcagagaa 
accagatatt 
cgaccaagaa 
gaacccgtta 
ggagtttaca 
tgacgatgca 
aggccacaac 
tggtgctatc 
tactcattct 
ataa 



caacttatca 

aaactgattc 

gaattgaaaa 

atccagtcca 

attgacatgt 

ccaggcgctt 

tcgtggatga 

gaaagttctt 

tacaatatag 

attagaaact 

atgcagaata- 

gaaagatgga 

ggtgagtggt 

aaaggccagc 

tatgtctata 

agtaaaagga 

gttcaagaag 



gaacattcac 
attgcgatct 
ttattgattt 
ggttttatcg 
ggtcgttagg 
ctgaatataa 
tagatatggg 
cttctacaca 
tggaaaagcc 
acaggtatcc 
;gggagtgttt 
caccacaaca 
ttcctccagg 
aaagtgaata 
atcctagctc 
aggagaatac 
ggccaacaag 



cactcaaata 
gaagccagag 
tggctcatct 
tgctcctgaa 
ttgtattgtt 
ccaattaaca 
taaaaactct 
aaagcatcgt 
cagtaaacaa 
taaaagcata 
gatccacttt 
agctatgcta 
atcgtcttta 
tggaagtgcg 
tgccactggt 
atctggcgac 
cgcgttcaat 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2424 



<210> 2 
<211> 807 
<212> PRT 

<213> HOMO SAPIENS 



<400> 
Met Asn Ser 
1 

Asn Asn Ser 

Gly Ser Gly 
35 

Trp Asn Pro 
50 

Gin His Gin 
65 

Gin Gin Asn 

Val Thr Asn 

Leu Asn Ser 
115 

Leu Val Val 

130 
Asp Ser Tyr 
145 

Ser Ser Gly 

Tyr Gin Gin 

Pro Ser Leu 
195 

Ala Thr Asn 

210 
Gin Gly Thr 



Ser Asn 
5 

Leu Ser 
20 

Arg Ala 

Ser Tyr 

Asn His 

Ser Gin 
85 

Ser Gin 
100 

Asn Glu 

Pro Pro 

Pro Ala 

Gin Tyr 
165 
Gly Ala 
180 

Tyr Asp 
Tyr Ser 
Asn Ser 



Asn Asn Asp 

Pro Thr Leu 

Ser Pro Asp 
40 

Val Asn Gin 
55 

His Gin Gin 
70 

Phe Cys Phe 

Gin Asn Leu 

Ser Leu Asp 
120 

Ala Arg Ala 
135 

Tyr Thr Ser 
150 

Pro Ser Gly 

lie His Pro 

Arg Gin Asp 
200 

Ser Asn Phe 

215 
He Pro Val 



Ser Ser Ser Ser Asn Ser Asn Met 

10 15 
Val Thr His Ser Asp Ala Ser Met 
25 30 
Asn Ser His Met Gly Arg Gly He 
45 

Gly Ser Gin Arg Ser Pro Gin Gin 
60 

Gin Gin Gin Gin Gin Gin Gin Gin 

7 5 80 
Val Asn Pro Trp Asn Glu Glu Lys 

90 95 
Val Tyr Pro Pro Gin Tyr Asp Asp 
105 "* HO 
Ala Tyr Arg Arg Arg Lys Ser Ser 
125 

Pro Ala Pro Asn Pro Phe Gin Tyr 
140 

Ser Asn Thr Ser Leu Ala Gly Asn 
155 160 
Tyr Gin Gin Gin Gin Gin Gin Val 

170 175 
Ser Gin Phe Gly Ser Arg Phe Val 
185 190 
Phe Gin Arg Arg Gin Ser Leu Ala 
205 

Ser Ser Leu Asn Ser Asn Thr Asn 
220 

Met Ser Pro Tyr Arg Arg Leu Ser 
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225 










230 


Ala 


Tyr 


Pro 


Pro 


Ser 


Thr 










245 




Leu 


Arg 


Arg 


Asp 


Glu 


Val 








260 






Gin 


Leu 


Cys 


Asn 


Ser 


Lys 






275 








Pro 


Lys 


Phe 


Arg 


Arg 


Ala 




290 










Val 


Ser 


Val 


Thr 


Lys 


Ser 


305 










310 


Glu 


Phe 


Thr 


Tyr 


Gin 


Thr 










325 




Pro 


Ser 


Glu 


Gly 


Lys 


Cys 








340 






Tyr 


He 


Leu 


Tyr 


Val 


Asn 






355 








Tyr 


Leu 


Val 


Leu 


Asp 


He 




370 










Lys 


Cys 


Gin 


Asn 


Leu 


Leu 


385 










390 


Lys 


Ser 


Arg 


Thr 


Glu 


Tyr 










405 




Leu 


Glu 


Leu 


Leu 


Asn 


Gin 








420 






Leu 


Arg 


Met 


Tyr 


Asp 


Ser 






435 








Phe 


Glu 


Leu 


Leu 


Ser 


Asn 




450 










Phe 


His 


Gly 


Leu 


Ser 


He 


465 










470 


Leu 


Asp 


Ser 


Leu 


Cys 


Val 










485 




Leu 


Lys 


Pro 


Glu 


Asn 


He 








500 






Lys 


He 


He 


Asp 


Phe 


Gly 






515 








Thr 


Tyr 


He 


Gin 


Ser 


Arg 




530 










He 


Pro 


Tyr 


Ser 


Thr 


Ser 


545 










550 


Ala 


Glu 


Leu 


Phe 


Leu 


Gly 










565 




Asn 


Gin 


Leu 


Thr 


Arg 


He 








580 






Met 


He 


Asp 


Met 


Gly 


Lys 






595 








Pro 


Glu 


Glu 


Ser 


Ser 


Ser 




610 










Glu 


Glu 


Phe 


Cys 


Arg 


Glu 


625 










630 


Tyr 


Phe 


Lys 


Trp 


Arg 


Lys 










645 




Pro 


Lys 


Ser 


He 


Gin 


Asn 



3 











235 




Ser 


Pro 


Pro 


Leu 


Gin 


Pro 








250 






Gin 


Gly 


Gin 


Lys 


Leu 


Ser 






265 








Asn 


Asp 


Leu 


Gin 


Pro 


Val 




280 










Ser 


Leu 


Asn 


Ser 


Lys 


Thr 


295 










30.0 


Leu 


He 


Thr 


Thr 


Tyr 


Ser 










315 




Ser 


Lys 


Asn 


Pro 


Lys 


Arg 








330 






Asn 


Asn 


Gly 


Phe 


Asp 


Asn 






345 








Asp 


Val 


Leu 


Gly 


Val 


Glu 




360 










Leu 


Gly 


Gin 


Gly 


Thr 


Phe 


375 










380 


Thr 


Lys 


Glu 


He 


Leu 


Ala 










395 




Leu 


Thr 


Gin 


Ser 


He 


Thr 








410 






Lys 


He 


Asp 


Pro 


Thr 


Asn 






425 








Phe 


Val 


His 


Lys 


Asn 


His 




440 










Asn 


Leu 


Tyr 


Glu 


Leu 


Leu 


455 










460 


Gin 


Leu 


He 


Arg 


Thr 


Phe 










475 




Leu 


Lys 


Glu 


Ser 


Lys 


Leu 








490 






Leu 


Leu 


Cys 


Ala 


Pro 


Asp 






505 








Ser 


Ser 


Cys 


Glu 


Glu 


Ala 




520 










Phe 


Tyr 


Arg 


Ala 


Pro 


Glu 


535 










540 


He 


Asp 


Met 


Trp 


Ser 


, Leu 










555 


• 


He 


Pro 


He 


Phe 


Pro 


Gly 








570 






He 


Asp 


Thr 


Leu 


Gly 


Tyr 






585 








Asn 


Ser 


Gly 


Lys 


Phe 


Met 




600 










Ser 


Thr 


Gin 


Lys 


His 


Arg 


615 










620 


Tyr 


Asn 


He 


Val 


Glu 


Lys 










635 




Leu 


Pro 


Asp 


He 


He 


Arg 








650 






Ser 


Gin 


Glu 


Leu 


He 


Asp 



PCT/USOO/30542 



240 



Pro 


Phe 


Lys 


Gin 






255 




He 


Pro 


Gin 


Met 




270 






Leu 


Asn 


Ala 


Thr 


o o zz. 

28 5 








He 


Ser 


Pro 


Leu 


Leu 


Cys 


Ser 


Pro 








320 


Val 


Leu 


Thr 


Lys 






335 




He 


Asn 


Ser 


Asp 




350 






Gin 


Asn 


Arg 


Lys 


o c c 

365 








Gly 


Gin 


Val 


Val 


Val 


Lys 


Val 


Val 








400 


Glu 


Ala 


Lys 


He 






415 




Lys 


His 


His 


Phe 




430 






Leu 


Cys 


Leu 


Val 


A A C 

4 4 D 








Lys 


Gin 


Asn 


Lys 


Thr 


Thr 


Gin 


He 








480 


He 


His 


Cys 


Asp 






495 




Lys 


Pro 


Glu 


Leu 




510 






Arg 


Thr 


Val 


Tyr 










He 


He 


Leu 


Gly 


Gly 


Cys 


lie 


Val 








560 


Ala 


Ser 


Glu 


Tyr 






575 




Pro 


Pro 


Ser 


Trp 




590 






Lys 


Lys 


Leu 


Ala 


605 








Met 


Lys 


Thr 


He 


Pro 


Ser 


Lys 


Gin 








640 


Asn 


Tyr 


Arg 


Tyr 






655 




Gin 


Glu 


Met 


Gin 
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660 










665 










670 






Asn 


Ar g 






4-1 ts u 


He 


His 


Phe 


Leu 


Gly 


Gly Val 


Leu 


Asn 


Leu 


Asn 




675 










680 










685 








Pro 


Leu 


Ol u 




lip 


Thr 


Pro 


Gin 


Gin 


Ala 


Met 


Leu 


His 


Pro 


Phe 


He 














695 










700 










Th r 


Lys 


Gin 


Glu 


Phe 


Thr 


Glv 


Glu 


Trp 


Phe 


Pro 


Pro 


Gly 


Ser 


Ser 


Leu 


705 








710 










715 










720 


Pro 


Gly 


Pro 


Ser 


Glu 


Lys 


His 


Asp 


Asp 


Ala 


Lys 


Gly 


Gin 


Gin 


Ser 


Glu 








725 










730 










735 




Tyr 


Gly 


Ser 


Ala 


Asn 


Asp 


Ser 


Ser 


Asn 


Asn 


Ala 


Gly 


His 


Asn 


Tyr 


Val 




740 










745 










750 






Tyr 


Asn 


Pro 


Ser 


Ser 


Ala 


Thr 


Gly 


Gly Ala Asp 


Ser 


Val 


Asp 


He 


Gly 




755 










760 










765 








Ala 


He 


Ser 


Lys 


Arg 


Lys 


Glu 


Asn 


Thr 


Ser 


Gly 


Asp 


He 


Ser 


Asn 


Asn 




770 










775 










780 










Phe 


Ala 


Val 


Thr 


His 


Ser 


Val 


Gin 


Glu 


Gly 


Pro 


Thr 


Ser 


Ala 


Phe 


Asn 


785 










790 










795 










800 


Lys 


Leu 


His 


He 


Val 


Glu 


Glu 





















805 



<210> 3 

<211> 2085 

<212> DNA 

<213> HOMO SAPIENS 



<400> 
ggaaacctt c 
atgaatgatc 
tttgaggata 
gtgggcaaaa 
caggggagct 
atgacacctg 
gagattttca 
atgacaggtg 
ccccacgatc 
gggcaggtgg 
cggaatgaga 
cggaagcagg 
cgcaaccaca 
aagaataaat 
cagtgcttgg 
attttgttaa 
tacgagcatc 
atccttgggg 
gagctcctga 
atgattgaac 
aattttgtga 
tctgtggtcc 
agagagtggg 
cagtgtttag 
tggctgagga 
actgagagca 
tccaaactga 
acagtgttgc 



ggccgccgct 
acctgcatgt 
acagtaacaa 
cgggcttgcc 
ccacctctct 
aacaagcaat 
gctaccctga 
ggcccaacaa 
acgtggctta 
tcaaggccta 
agcgcttcca 
acaaggataa 
tctgcatgac 
tccagggctt 
atgctttgca 
agcagcaggg 
agcgtgtcta 
ccaggtatgg 
cgggttaccc 
tgttgggcat 
gctccaaggg 
taaacggagg 
gtaacgcgct 
agtgggatcc 
ggcggttgcc 
ccggtgctat 
ggactaattt 
caaaacttgt 



cccgccgcct 
cggcagccac 
gcggacagtg 
agtggtgcca 
aaagtccatg 
gaagcaatac 
aatatatttc 
tggtggctat 
caggtatgag 
cgatcacaaa 
ccggcaagca 
cacaatgaat 
gtttgagctg 
cagtctg.cct 
caaaaacaga 
tagaagcggt 
cacgtacatc 
catgcccatt 
cctcttgcct 
gccctcacag 
ttatccccgt 
ccgttcccgg 
gaaggggtgt 
tgcagtgcgc 
aaagcctccc 
cacatctata 
ggcgcagatg 
tagctgagct 



acccgaccga 
gctcacggac 
ctcacgacac 
gagcggcagc 
gaaggcatgg 
atgcaaaaac 
ttgggtctaa 
gatgatgacc 
gtcctcaagg 
gtccaccagc 
gcggaggaga 
gtcatccata 
ctgagcatga 
ttggttcgca 
ataattcact 
attaaagtaa 
cagtcgcgtt 
gatatgtgga 
ggggaagatg 
aaactgctgg 
tactgcactg 
agggggaaac 
gatgatcccc 
atgaccccag 
accggggaga 
tccaagttac 
acagatgcca 
cacgtcccct 



ttggcggcag 
agatccaggt 
aaccaaatgg 
tggacagcat 
ggaaggtgaa 
tcacagcctt 
atgctaagaa 
agggatcata 
tcattgggaa 
acgtggccct 
tccgaatcct 
tgctggagaa 
acctctataa 
agtttgccca 
gtgaccttaa 
ttgattttgg 
tttaccgggc 
gcctgggctg 
aaggggacca 
;atgcatccaa 
tcacgactct 
tgaggggccc 
ttttccttga 
gccaggcttt 
aaacgtcagt 
ctccaccttc 
atgggaatat 
gatgctggta 



taagcacaca 
tcaacagttg 
gcttacaaca 
tcatagacgg 
agccaccccc 
cgaacaccat 
gcgccagggc 
tgtgcaggtg 
ggggagcttt 
aaagatggtg 
ggaacacctg 
tttcaccttc 
gctcatcaag 
ctcgattctg 
gcccgagaac 
ctccagttgt 
tccagaagtg 
cattttagca 
gctggcctgt 
acgagccaaa 
ctcagatggc 
accggagagc 
cttcttaaaa 
gcggcacccc 
gaaaaggata 
tagctcagct 
tcagcagagg 
acctgaaaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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tacgacattg ctgagcctta ctgggttgaa aaggagtagc tcagacctgt ttttatttgc 1740 

tcaataactc tactcatttg tatcttttca gcacttaatt ttaatgtaag aaagttgttc 1800 

attttgtttt tataaaatac atgaggacaa tgctttaagt-ttttatactt tcagaaactt 1860 

tttgtgttct aaaagtacaa tgagccttac tgtatttagt gtggcagaat aataacatca 1920 

atggcaggcc actgattact tcatgactgc cacgcattta cagattggtg tcaaagacat 1980 

tcactatgtt tttatggttc atgttatatc ctccccaggg tgacagcccc ttaaggccct 2040 

ccttttccct ccatgctcca ggtccatgca caggtgtagc atgtc 2085 

<210> 4 
<211> 528 
<212> PRT 

<213> HOMO SAPIENS 
<400> 4 

Met Asn Asp His Leu His Val Gly Ser His Ala His Gly Gin lie Gin 

1 5 10 15 

Val Gin Gin Leu Phe Glu Asp Asn Ser Asn Lys Arg Thr Val Leu Thr 

20 25 " 30 

Thr Gin Pro Asn Gly Leu Thr Thr Val Gly Lys Thr Gly Leu Pro Val 

35 40 45 

Val Pro Glu Arg Gin Leu Asp Ser lie His Arg Arg Gin Gly Ser Ser 

50 55 60 

Thr Ser Leu Lys Ser Met Glu Gly Met Gly Lys Val Lys Ala Thr Pro 
65 70 75 80 

Met Thr Pro Glu Gin Ala Met Lys Gin Tyr Met Gin Lys Leu Thr Ala 

85 90 95 

Phe Glu His His Glu lie Phe Ser Tyr Pro Glu"**. lie Tyr Phe Leu Gly 

100 105 110 

Leu Asn Ala Lys Lys Arg Gin Gly Met Thr Gly Gly Pro Asn Asn Gly 

115 120 125 

Gly Tyr Asp Asp Asp Gin Gly Ser Tyr Val Gin Val Pro His Asp His 

130 135 140 

Val Ala Tyr Arg Tyr Glu Val Leu Lys Val lie Gly Lys Gly Ser Phe 
145 150 155 ^ 160 

Gly Gin Val Val Lys Ala Tyr Asp His Lys Val His Gin His Val Ala 

165 170 175 

Leu Lys Met Val Arg Asn Glu Lys Arg Phe His Arg Gin Ala Ala Glu 

180 185 ~ 190 

Glu lie Arg lie Leu Glu His Leu Arg Lys Gin Asp Lys Asp Asn Thr 

195 200 205 

Met Asn Val lie His Met Leu Glu Asn Phe Thr Phe Arg Asn His lie 

210 215 220 

Cys Met Thr Phe Glu Leu Leu Ser Met Asn Leu Tyr Lys Leu lie Lys 
225 230 235 240 

Lys Asn Lys Phe Gin Gly Phe Ser Leu Pro Leu Val Arg Lys Phe Ala 

245 250 " 255 

His Ser lie Leu Gin Cys Leu Asp Ala Leu His Lys Asn Arg lie lie 

260 265 270 

His Cys Asp Leu Lys Pro Glu Asn lie Leu Leu Lys Gin Gin Gly Arg 

2 75 280 , 285 

Ser Gly lie Lys Val lie Asp Phe Gly Ser Ser *Cys Tyr Glu His Gin 

290 295 300 

Arg Val Tyr Thr Tyr lie Gin Ser Arg Phe Tyr Arg Ala Pro Glu Val 
305 310 315 320 
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lie 


Leu 


Gly 


Ala 


Arg 


Tyr 


oiy 






Tip 

_L JL t: 


Asp 


Met 


T rn 
up 


Ser 


Leu 


Gly 


















j> ~j \j 










335 




Cys 


lie 


Leu 


ax a 


(jXU 


Leu 


Leu 


1 11 1 


oiy 


T 1 \T 7~ 

i yr 


Pro 


Leu 


Leu 


Pro 


Gly 


Glu 






■a/I n 




















350 






Asp 


Glu 


Gly 


Asp 


bin 


Leu 


ZA.-L a 


Cys 


LYie L. 


Tip 
11c 


Glu 


Leu 


Xieu 


Gly 


Mpf 
L 1C L. 


IT .L KJ 




JJ D 










JUU 










365 








Ser 


Gin 


Lys 


Leu 


Leu 


Asp 


ax a 


Qo r- 

JCl 


Lys 


A v rr 

mi y 


Ala 


Lys 


/"lO 1 1 


Phe 


Val 


Ser 




"3 "7 O 








o / o 










380 










Ser 


Lys 


Gly 


Tyr 


Pro 


Arg 


Tyr 


Cys 


1 111 


val 


Thr-.Thr 


- 

LSU 


Ser 


Asp 


Gly 


385 








*5 on 










395 










4 00 


Ser 


Val 


Val 


Leu 


Asn 


Gly 


uxy 


Arg 


Q Q y- 


Arg 


Arg 


Gly 


Lys 


lit- Li 


Ml y 


uiy 










4 Ud 










/in 










415 




Pro 


Pro 


Glu 


Ser 


Arg 


CaXU 


Trp 


bxy 


Asn 


Ala 


Leu 


Lys 


uiy 


o _y ki> 


no jkJ 


7\ er> 
nop 








420 










425 










430 






Pro 


Leu 


Phe 


Leu 


Asp 


Phe 


Leu 


Lys 


Gin 


Cys 


Leu 


Glu 


Trp 


Asp 


Pro 


Ala 






435 










440 










445 








Val 


Arg 


Met 


Thr 


Pro 


Gly 


Gin 


Ala 


Leu 


Arg 


His 


Pro 


Trp 


Leu 


Arg 


Arg 




450 










455 










460 










Arg 


Leu 


Pro 


Lys 


Pro 


Pro 


Thr 


Gly 


Glu 


Lys 


Thr 


Ser 


Val 


Lys 


Arg 


He 


465 










470 










475 










480 


Thr 


Glu 


Ser 


Thr 


Gly 


Ala 


He 


Thr 


Ser 


He 


Ser 


Lys 


Leu 


Pro 


Pro 


Pro 










485 










490 










495 




Ser 


Ser 


Ser 


Ala 


Ser 


Lys 


Leu 


Arg 


Thr 


Asn 


Leu 


Ala 


Gin 


Met 


Thr 


Asp 








500 










505 










510 






Ala 


Asn 


Gly 


Asn 


He 


Gin 


Gin 


Arg 


Thr 


Val 


Leu 


Pro 


Lys 


Leu 


Val 


Ser 



515 520 525 

<210> 5 

<211> 2061 

<212> DNA 

<213> HOMO SAPIENS 



<400> 5 '• 

ggagcgaaat gcgctgagct gcagtgtctg gtcgagagta cccgtgggag cgtcgcgccg 60 

cggaggcagc cgtcccggcg taggtggcgt ggccgaccgg acccccaact ggcgcctctc 120 

cccgcgcggg gtcccgagct aggagatggg aggcacagct cgtgggcctg ggcggaagga 180 

tgcggggccg cctggggccg ggctcccgcc ccagcagcgg aggttggggg atggtgtcta 240 

tgacaccttc atgatgatag atgaaaccaa atgtcccccc tgttcaaatg tactctgcaa 300 

tccttctgaa ccacctccac ccagaagact aaatatgacc actgagcagt ttacaggaga 360 

tcatactcag cactttttgg atggaggtga gatgaaggta gaacagctgt ttcaagaatt 420 

tggcaacaga aaatccaata ctattcagtc agatggcatc agtgactctg aaaaatgctc 480 

tcctactgtt tctcagggta aaagttcaga ttgcttgaat acagtaaaat ccaacagttc 540 

atccaaggca cccaaagtgg tgcctctgac tccagaacaa gccctgaagc aatataaaca 600 

ccacctcact gcctatgaga aactggaaat aattaattat ccagaaattt actttgtagg 660 

tccaaatgcc aagaaaagac atggagttat tggtggtccc aataatggag ggtatgatga 720 

tgcagatggg gcctatattc atgtacctcg agaccatcta gcttatcgat atgaggtgct 780 

gaaaattatt ggcaagggga gttttgggca ggtggccagg gtctatgatc acaaacttcg 840 

acagtacgtg gccctaaaaa tggtgcgcaa tgagaagcgc tttcatcgtc aagcagctga 900 

ggagatccgg attttggagc atcttaagaa acaggataaa actggtagta tgaacgttat 960 

ccacatgctg gaaagtttca cattccggaa ccatgtttgc atggcctttg aattgctgag 1020 

catagacctt tatgagctga ttaaaaaaaa taagtttcag ggttttagcg tccagttggt 1080 

acgcaagttt gcccagtcca tcttgcaatc tttggatgcc ctccacaaaa ataagattat 114 0 

tcactgcgat ctgaagccag aaaacattct cctgaaacac cacgggcgca gttcaaccaa 1200 

ggtcattgac tttgggtcca gctgtttcga gtaccagaag ctctacacat atatccagtc 1260 



WO 01/32918 



PCTAJS00/30542 



7 



tcggttctac 
atggagtttt 
ggatgaagga 
tctggagcaa 
ctctgtgact 
taaaaagcgg 
ctacttgttt 
cccagctcaa 
catagacaag 
ttctaagctg 
aaccaatggt 
gatatgccca 
aaatgcaagc 
catttttata 



agagctccag 
ggctgcatcc 
gaccagttgg 
tccaaacgtg 
acccaggcag 
ggtcccccag 
atagagttct 
gcattaagac 
gtgtcaggga 
cctccagttg 
agtatacccc 
gagatgcata 
ccattggtgg 
tgattataaa 



aaatcatctt 
ttgcagaact 
cctgcatgat 
ccaagtactt 
atgggagggt 
gcagcaaaga 
tgaaaaggtg 
acccttggat 
aacgggtagt 
ttggaatagc 
tatgcagtgt 
tgtgtatatt 
atgtttttgt 
a 



aggaagccgc 
tttaacagga 
ggagcttcta 
tattaattcc 
tgtgcttgtg 
ctgggggaca 
tcttcactgg 
tagcaagtct 
taatcctgca 
caataagctt 
attgccaaaa 
tttatgatct 
tagagtagac 



tacagcacac 
cagcctctct 
gggatgccac 
aagggcatac 
gggggtcgct 
gcactgaaag 
gacccctctg 
gtccccagac 
agtgctttcc 
aaagctaact 
ctgattagct 
tacaaacctg 
tttttttaaa 



caattgacat 
tccctggaga 
caccaaaact 
cccgctactg 
cacgtagggg 
ggtgtgatga 
cccgcttgac 
ctctcaccac 
agggattggg 
taatgtcaga 
agtggacaga 
caaatggaaa 
caagacaaaa 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2061 



<210> 6 
<211> 588 
<212> PRT 

<213> HOMO SAPIENS 





<400> 


6 










Met 


Gly 


Gly 


Thr 


Ala 


Arg 


Gly 


Pro 


1 








5 








Gly 


Ala 


Gly 


Leu 


Pro 


Pro 


Gin 


Gin 








20 










Asp 


Thr 


Phe 


Met 


Met 


He 


Asp 


Glu 






35 










40 


Val 


Leu 


Cys 


Asn 


Pro 


Ser 


Glu 


Pro 




50 










55 




Thr 


Thr 


Glu 


Gin 


Phe 


Thr 


Gly 


Asp 


65 










70 






Gly 


Glu 


Met 


Lys 


Val 


Glu 


Gin 


Leu 










85 








Ser 


Asn 


Thr 


He 


Gin 


Ser 


Asp 


Gly 








100 










Pro 


Thr 


Val 


Ser 


Gin 


Gly 


Lys 


Ser 






115 










120 


Ser 


Asn 


Ser 


Ser 


Ser 


Lys 


Ala 


Pro 




130 










135 




Gin 


Ala 


Leu 


Lys 


Gin 


Tyr 


Lys 


His 


145 










150 






Glu 


He 


He 


Asn 


Tyr 


Pro 


Glu 


He 










165 








Lys 


Arg 


His 


Gly 


Val 


He 


Gly 


Gly 








180 










Ala 


Asp 


Gly Ala 


Tyr 


He 


His 


Val 






195 










200 


Tyr 


Glu 


Val 


Leu 


Lys 


He 


He 


Gly 




210 










215 




Arg 


Val 


Tyr 


Asp 


His 


Lys 


Leu 


Arg 


225 










230 






Arg 


Asn 


Glu 


Lys 


Arg 


Phe 


His 


Arg 










245 








Leu 


Glu 


His 


Leu 


Lys 


Lys 


Gin 


Asp 



Gly 


Arg 


Lys 


Asp 


Ala 


Gly 


Pro 


Pro 




10 










15 




Arg 


Arg 


Leu 


Gly 


Asp 


Gly Val 


Tyr 


25 










30 






Thr 


Lys 


Cys 


Pro 


Pro 


Cys 


Ser 


Asn 










45 








Pro 


Pro 


Pro 


Arg 


Arg 


Leu 


Asn 


Met 








60 










His 


Thr 


Gin 


His 


Phe 


Leu 


Asp 


Gly 






75 










80 


Phe 


Gin 


Glu 


Phe 


Gly Asn Arg 


Lys 




90 










95 




He 


Ser 


Asp 


Ser 


Glu 


Lys 


Cys 


Ser 


105 










110 






Ser 


Asp 


Cys 


Leu 


Asn 


Thr 


Val 


Lys 










125 








Lys 


Val 


Val 


. Pro 


Leu 


Thr 


Pro 


Glu 








-140 










His 


Leu 


Thr 


Ala 


Tyr 


Glu 


Lys 


Leu 






155 










160 


Tyr 


Phe 


Val 


Gly 


Pro 


Asn 


Ala 


Lys 




170 










175 




Pro 


Asn 


Asn 


Gly 


Gly 


Tyr 


Asp 


Asp 


185 










190 






Pro 


Arg 


Asp 


His 


Leu 


Ala 


Tyr 


Arg 










205 








Lys 


Gly 


Ser 


Phe 


Gly 


Gin 


Val 


Ala 








220 










Gin 


Tyr 


Val 


Ala 


Leu 


Lys 


Met 


Val 






235 










240 


Gin 


Ala 


Ala 


Glu 


Glu 


He 


Arg 


He 




250 










255 




Lys 


Thr 


Gly 


Ser 


Met 


Asn 


Val 


He 



WO 01/32918 



PCT/USOO/30542 



8 



260 

His Met Leu Glu 
275 

Glu Leu Leu Ser 
290 

Gin Gly Phe Ser 
305 

Gin Ser Leu Asp 

Lys Pro Glu Asn 
340 

Val lie Asp Phe 
355 

Tyr lie Gin Ser 
370 

Arg Tyr Ser Thr 
385 

Glu Leu Leu Thr 

Gin Leu Ala Cys 
420 

Leu Glu Gin Ser 
435 

Pro Arg Tyr Cys 
450 

Val Gly Gly Arg 
465 

Lys Asp Trp Gly 

Glu Phe Leu Lys 
500 

Pro Ala Gin Ala 
515 

Pro Leu Thr Thr 
530 

Ala Ser Ala Phe 
545 

lie Ala Asn Lys 

lie Pro Leu Cys 
580 



Ser Phe Thr Phe 
280 

lie Asp Leu Tyr 
295 

Val Gin Leu Val 
310 

Ala Leu His Lys 
325 

lie Leu Leu Lys 

Gly Ser Ser Cys 
360 

Arg Phe Tyr Arg 
375 

Pro lie Asp lie 
390 

Gly Gin Pro Leu 
405 

Met Met Glu Leu 

Lys Arg Ala Lys 
440 

Ser Val Thr Thr 
455 

Ser Arg Arg Gly 
470 

Thr Ala Leu Lys 
485 

Arg Cys Leu His 

Leu Arg His Pro 
520 

lie Asp Lys Val 
535 

Gin Gly Leu Gly 
550 

Leu Lys Ala Asn 
565 

Ser Val Leu Pro 



265 

Arg Asn His Val 

Glu Leu lie Lys 
300 

Arg Lys Phe Ala 
315 

Asn Lys lie He 
330 

His His Gly Arg 
345 

Phe Glu Tyr Gin 

Ala Pro Glu He 
380 

Trp Ser Phe Gly 
395 

Phe Pro Gly Glu 
410 

Leu Gly Met Pro 
425 

Tyr Phe He Asn 

Gin Ala Asp Gly 
460 

Lys Lys Arg Gly 
475 

Gly Cys Asp Asp 
490 

Trp Asp Pro Ser 
505 

Trp He Ser Lys 

Ser Gly Lys Arg 
540 

Ser Lys Leu Pro 
555 

Leu Met Ser Glu 
570 

Lys Leu He Ser 
585 



270 

Cys Met Ala Phe 
285 

Lys Asn Lys Phe 

Gin Ser He Leu 
320 

His Cys Asp Leu 
335 

Ser Ser Thr Lys 
350 

Lys Leu Tyr Thr 
365 

He Leu Gly Ser 

Cys He Leu Ala 
400 

Asp Glu Gly Asp 
415 

Pro Pro Lys Leu 
430 

Ser Lys Gly He 
445 

Arg Val Val Leu 

Pro Pro Gly Ser 
480 

Tyr Leu Phe He 
495 

Ala Arg Leu Thr 
510 

Ser Val Pro Arg 
525 

Val Val Asn Pro 

Pro Val Val Gly 
560 

Thr Asn Gly Ser 
575 



<210> 7 

<211> 2327 

<212> DNA 

<213> HOMO SAPIENS 



<400> 7 

cggcgctggc aagcgaagct tgggggtggg gaggtagagt gagccctcag taggagggac 

gagggcaggg gtctgactgc ctccccggga ccgcccccac ctcctctcta tcagggcccc 

ctccccccat ccctgtctca ccgggcgcgg gggacggggc tagagcggag ttagagcaag 

aagaatttcc acccctggat tccctctgaa accctagatc ggggtatatg ttaagggatt 

acgaaaatct aggacttttt gtggggcttt ttattaaagg gggggagccc gggagcaata 

ccttggaaag aagccctgtt gcttagagcg gataaccaac ggctgaactc ttggggtttg 
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ctgtgagggg tgcggtctag cttcgaatgt acagtggtgg agccacagtg ttaaagaaca 420 

gagaagtgat ccttaatcat ttagaatttt gcctccacca tccaccagaa aatgaagtgg 480 

aaagagaagt tgggggatgg tgtctatgac accttcatga tgatagatga aaccaaatgt 540 

cccccctgtt caaatgtact ctgcaatcct tctgaaccac'-ctccacccag aagactaaat 600 

atgaccactg agcagtttac aggagatcat actcagcact ttttggatgg aggtgagatg 660 

aaggtagaac agctgtttca agaatttggc aacagaaaat ccaatactat tcagtcagat 720 

ggcatcagtg actctgaaaa atgctctcct actgtttctc agggtaaaag ttcagattgc 780 

ttgaatacag taaaatccaa cagttcatcc aaggcaccca aagtggtgcc tctgactcca 840 

gaacaagccc tgaagcaata taaacaccac ctcactgcct atgagaaact ggaaataatt 900 

aattatccag aaatttactt tgtaggtcca aatgccaaga aaagacatgg agttattggt 960 

ggtcccaata atggagggta tgatgatgca gatggggcct atattcatgt acctcgagac 1020 

catctagctt atcgatatga ggtgctgaaa attattggca aggggagttt tgggcaggtg 1080 

gccagggtct atgatcacaa acttcgacag tacgtggccc taaaaatggt gcgcaatgag 1140 

aagcgctttc atcgtcaagc agctgaggag atccggattt tggagcatct taagaaacag 1200 

gataaaactg gtagtatgaa cgttatccac atgctggaaa gtttcacatt ccggaaccat 1260 

gtttgcatgg cctttgaatt gctgagcata gacctttatg agctgattaa aaaaaataag 1320 

tttcagggtt ttagcgtcca gttggtacgc aagtttgccc agtccatctt gcaatctttg 1380 

gatgccctcc acaaaaataa gattattcac tgcgatctga agccagaaaa cattctcctg 1440 

aaacaccacg ggcgcagttc aaccaaggtc attgactttg ggtccagctg tttcgagtac 1500 

cagaagctct acacatatat ccagtctcgg ttctacagag ctccagaaat catcttagga 1560 

agccgctaca gcacaccaat tgacatatgg agttttggct gcatccttgc agaactttta 1620 

acaggacagc ctctcttccc tggagaggat gaaggagacc agttggcctc catgatggag 1680 

cttctaggga tgccaccacc aaaacttctg gagcaatcca aacgtgccaa gtactttatt 1740 

aattccaagg gcataccccg ctactgctct gtgactaccc aggcagatgg gagggttgtg 1800 

cttgtggggg gtcgctcacg taggggtaaa aagcggggtc ccccaggcag caaagactgg 1860 

gggacagcac tgaaagggtg tgatgactac ttgtttatag agttcttgaa aaggtgtctt 1920 

cactgggacc cctctgcccg cttgacccca gctcaagcat ... taagacaccc ttggattagc 1980 

aagtctgtcc ccagacctct caccaccata gacaaggtgt 'cagggaaacg ggtagttaat 2040 

cctgcaagtg ctttccaggg attgggttct aagctgcctc cagttgttgg aatagccaat 2100 

aagcttaaag ctaacttaat gtcagaaacc aatggtagta tacccctatg cagtgtattg 2160 

ccaaaactga ttagctagtg gacagagata tgcccagaga tgcatatgtg tatattttta 2220 

tgatcttaca aacctgcaaa tggaaaaaat gcaagcccat tggtggatgt ttttgttaga 2280 

gtagactttt tttaaacaag acaaaacatt tttatatgat tataaaa 2327 



<210> 8 
<211> 568 
<212> PRT 

<213> HOMO SAPIENS 
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Met 


Lys 


Trp 


Lys 


Glu 


Lys 


Leu 


Gly Asp 


Gly Val 


Tyr 


Asp 


Thr 


Phe 


Met 


1 








5 










10 










15 




Met 


He 


Asp 


Glu 
20 


Thr 


Lys 


Cys 


Pro 


Pro 
25 


Cys 


Ser 


Asn 


Val 


Leu 
30 


Cys 


Asn 


Pro 


Ser 


Glu 
35 


Pro 


Pro 


Pro 


Pro 


Arg 
40 


Arg 


Leu 


Asn 


Met 


Thr 
45 


Thr 


Glu 


Gin 


Phe 


Thr 
50 


Gly 


Asp 


His 


Thr 


Gin 
55 


His 


Phe 


Leu 


Asp 


Gly 
60 


Gly 


Glu 


Met 


Lys 


Val 


Glu 


Gin 


Leu 


Phe 


Gin 


Glu 


Phe 


Gly 


Asn 


Arg 


Lys 


Ser 


Asn 


Thr 


He 


65 










70 










75 










80 


Gin 


Ser 


Asp 


Gly 


He 


Ser 


Asp 


Ser 


Glu 


Lys 


Cys/ Ser 


Pro 


Thr 


Val 


Ser 










85 










90 










95 
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Lys 


Ser 
100 


Ser 


Asp 


Cys 


Leu 


Asn 
105 


Thr 


Val 


Lys 


Ser 


Asn 
110 


Ser 


Ser 
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Ser Lys Ala Pro 
115 

Gin Tyr Lys His 
130 

Tyr Pro Glu lie 
145 

Val lie Gly Gly 

Tyr lie His Val 
180 

Lys lie lie Gly 
195 

His Lys Leu Arg 
210 

Arg Phe His Arg 
225 

Lys Lys Gin Asp 

Ser Phe Thr Phe 
260 

lie Asp Leu Tyr 
275 

Val Gin Leu Val 
290 

Ala Leu His Lys 
305 

lie Leu Leu Lys 

Gly Ser Ser Cys 
340 

Arg Phe Tyr Arg 
355 

Pro lie Asp lie 
370 

Gly Gin Pro Leu 
385 

Met Met Glu Leu 

Lys Arg Ala Lys 
420 

Ser Val Thr Thr 
435 

Ser Arg Arg Gly 
450 

Thr Ala Leu Lys 
4 65 

Arg Cys Leu His 

Leu Arg His Pro 
500 

lie Asp Lys Val 
515 

Gin Gly Leu Gly 
530 



Lys Val Val Pro 
120 

His Leu Thr Ala 
135 

Tyr Phe Val Gly 
150 

Pro Asn Asn Gly 
165 

Pro Arg Asp His 

Lys Gly Ser Phe 
200 

Gin Tyr Val Ala 
215 

Gin Ala Ala Glu 
230 

Lys Thr Gly Ser 
245 

Arg Asn His Val 

Glu Leu lie Lys 
280 

Arg Lys Phe Ala 
295 

Asn Lys lie lie 
310 

His His Gly Arg 
325 

Phe Glu Tyr Gin 

Ala Pro Glu lie 
360 

Trp Ser Phe Gly 
37 5 

Phe Pro Gly Glu 
390 

Leu Gly Met Pro 
405 

Tyr Phe lie Asn 

Gin Ala Asp Gly 
440 

Lys Lys Arg Gly 
455 

Gly Cys Asp Asp 
470 

Trp Asp Pro Ser 
485 

Trp lie Ser Lys 

Ser Gly Lys Arg 
520 

Ser Lys Leu Pro 
535 



Leu Thr Pro Glu 

Tyr Glu Lys Leu 
140 

Pro Asn Ala Lys 
155 

Gly Tyr Asp Asp 
170 

Leu Ala Tyr-Arg 
185 

Gly Gin Val Ala 

Leu Lys Met Val 
220 

Glu He Arg He 
235 

Met Asn Val He 
250 

Cys Met Ala Phe 
265 

Lys Asn Lys Phe 

Gin Ser He Leu 
300 

His Cys Asp Leu 
315 

Ser Ser Thr Lys 
330 

Lys Leu Tyr Thr 
345 

He Leu Gly Ser 

i' 

Cys He Leu 'Ala 
380 

Asp Glu Gly Asp 
395 

Pro Pro Lys Leu 
410 

Ser Lys Gly He 
425 

Arg Val Val Leu 

Pro Pro Gly Ser 
460 

Tyr Leu Phe He 
475 

Ala Arg Leu Thr 
4 90 

Ser Val Pro Arg 
505 

Val Val Asn Pro 

Pro Val Val Gly 
540 



Gin Ala Leu Lys 
125 

Glu He He Asn 

Lys Arg His Gly 
160 

Ala Asp Gly Ala 
175 

Tyr Glu Val Leu 
190 

Arg Val Tyr Asp 
205 

Arg Asn Glu Lys 

Leu Glu His Leu 
240 

His Met Leu Glu 
255 

Glu Leu Leu Ser 
270 

Gin Gly Phe Ser 
285 

Gin Ser Leu Asp 

Lys Pro Glu Asn 
320 

Val He Asp Phe 
335 

Tyr He Gin Ser 
350 

Arg Tyr Ser Thr 
365 

Glu Leu Leu Thr 

Gin Leu Ala Ser 
400 

Leu Glu Gin Ser 
415 

Pro Arg Tyr Cys 
430 

Val Gly Gly Arg 
445 

Lys Asp Trp Gly 

Glu Phe Leu Lys 
480 

Pro Ala Gin Ala 
495 

Pro Leu Thr Thr 
510 

Ala Ser Ala Phe 
525 

He Ala Asn Lys 
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Leu Lys Ala Asn Leu Met Ser Glu Thr Asn Gly Ser lie Pro Leu Cys 
545 " 550 555 560 

Ser Val Leu Pro Lys Leu lie Ser 
565 

<210> 9 

<211> 15 

<212> PRT 

<213> HOMO SAPIENS 

<400> 9 

Leu Gly Gly Arg Asp Ser Arg Ser Gly Ser Pro Met Ala Arg Arg 
1 5 10 15 

<210> 10 

<211> 16 

<212> PRT 

<213> HOMO SAPIENS 

<400> 10 

Leu Gly Gly Arg Asp Ser Arg Ser Gly Ser Ala Met Ala Arg Arg Pro 
15 10 15 

<210> 11 

<211> 16 

<212> PRT 

<213> HOMO SAPIENS 

<400> 11 

Leu Gly Gly Arg Asp Ser Arg Ser Gly Thr Pro Met Ala Arg Arg Pro 
1 5 10 ~ 15 

<210> 12 

<211> 16 

<212> PRT 

<213> HOMO SAPIENS 

<400> 12 

Leu Gly Gly Arg Asp Ser Arg Ser Gly Ser Pro-Pro Ala Arg Arg Pro 
15 10 15 

<210> 13 

<211> 16 

<212> PRT 

<213> HOMO SAPIENS 

<400> 13 

Leu Gly Gly Arg Asp Ser Arg Ser Pro Ser Pro Met Ala Arg Arg Pro 
1 5 10 15 

<210> 14 

<211> 16 

<212> PRT 

<213> HOMO SAPIENS 
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<400> 14 

Leu Gly Gly Arg Asp Ser Arg Ser Pro Ser Ala Met Ala Arg Arg Pro 
I 5 10 15 

<210> 15 
<211> 15 
<212> PRT 

<213> HOMO SAPIENS 
<400> 15 

Leu Gly Gly Arg Asp Ser Arg Ala Gly Ser Pro Met Ala Arg Arg 
1 5 10 15 

<210> 16 
<211> 16 
<212> PRT 

<213> HOMO SAPIENS 
<400> 16 

Leu Gly Gly Arg Asp Ala Arg Ser Gly Ser Pro 'Met Ala Arg Arg Pro 
15 10 15 

<210> 17 
<211> 15 
<212> PRT 

<213> HOMO SAPIENS 
<400> 17 

Leu Gly Gly Arg Asp Ala Arg Ala Gly Ser Pro Met Ala Arg Arg 
15 10 15 

<210> 18 
<211> 14 
<212> PRT 

<213> HOMO SAPIENS 



<400> 18 

Asn lie Val Thr Pro Arg Thr Pro Pro Pro Ser Gin Gly Lys 
15 10 
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